THE BELL SYSTEM 





Volume 52 J anuary 1973 Number | 


Measurements of Transfer Inefficiency of 250-Element 
Undercut-Isolated Charge Coupled Devices 
M. F. Tompsett, B. B. Kosicki, and D. Kahng 1 


Formulas on Queues in Burst Processes—I 
B. Gopinath, D. Mitra, and M. M. Sondhi 9 


A Frame-to-Frame Picturephone® Coder for Signals 
Containing Differential Quantizing Noise 
D. J. Connor, B. G. Haskell, and F. W. Mounts 35 


A DC-to-2.3-GHz Amplifier Using an “Embedding” 
Scheme G. White and G. M. Chin 53 


Coupling Coefficients for Imperfect Asymmetric Slab 
Waveguides D. Marcuse’ 63 


Batch Input to a Multiserver Queue With Constant 
Service Times A. Kuczura 83 


Crosstalk in Uniformly Coupled Lossy Transmission 
Lines J. C. Isaacs, Jr., and N. A. Strakhov 101 


Applications for Quantum Amplifiers in Simple Digital 
Optical Communication Systems S. D. Personick 117 


A Proper Model for Testing the Planarity of Electrical 
Circuits A. J. Goldstein and D. G. Schweikert 135 


Contributors to This Issue 143 


THE BELL SYSTEM TECHNICAL JOURNAL 


ADVISORY BOARD 
D. E. PROCKNOW, President, Western Electric Company 
J. B. FISK, President, Bell Telephone Laboratories 


W. L. LINDHOLM, Vice Chairman of the Board, 
American Telephone and Telegraph Company 


EDITORIAL COMMITTEE 


W. E. DANIELSON, Chairman 


F. T. ANDREWS, JR. D. GILLETTE 
Ss. J. BUCHSBAUM A. E. JOEL, JR. 
R. P. CLAGETT B. E. STRASSER 
I. DORROS D. G. THOMAS 


Cc. R. WILLIAMSON 


EDITORIAL STAFF 


L. A. HOWARD, JR., Editor 
R. E. GILLIS, Associate Editor 
J. B. FRY, Art and Production Editor 


F. J. SCHWETJE, Circulation 


THE BELL SYSTEM TECHNICAL JOURNAL is published ten times 
a year by the American Telephone and Telegraph Company, J. D. deButts, 
Chairman and Chief Executive Officer, R. D. Lilley, President, J. J. Scanlon, 
Vice President and Treasurer, R. W. Ehrlich, Secretary. Checks for subscrip- 
tions should be made payable to American Telephone and Telegraph Company 
and should be addressed to the Treasury Department, Room 2312C, 195 Broad- 
way, New York, N. Y. 10007. Subscriptions $10.00 per year; single copies 
$1.25 each. Foreign postage $1.00 per year; 15 cents per copy. Printed in U.S.A. 


THE BELL SYSTEM 
TECHNICAL JOURNAL 


DEVOTED TO THE SCIENTIFIC AND ENGINEERING 
ASPECTS OF ELECTRICAL COMMUNICATION 


Volume 52 January 1973 Number 1 


Copyright © 1973, American Telephone and Telegraph Company. Printed in U.S.A. 


Measurements of Transfer Inefficiency of 
250-Element Undercut-Isolated 
Charge Coupled Devices 


By M. F. TOMPSETT, B. B. KOSICKI, and D. KAHNG 
(Manuscript received August 10, 1972) 


A 250-element charge coupled device 1s described in which the transfer 
electrodes are delineated and isolated using an undercut-etch technique. 
The device has metal electrodes on two thicknesses of oxide and ts primarily 
intended to be operated in a two-phase manner. Measurements of transfer 
inefficiency as a function of frequency have been made on both n- and 
p-channel devices. Below 1 MHz, values of 4 X 10~* per transfer inde- 
pendent of transfer frequency have been obtained. Above 1 M Hz the transfer 
inefficiency progressively rises as the dynamics of charge motion limit the 
transfer of charge. 


I. INTRODUCTION 


A new method of fabricating charge coupled devices! (CCD) using 
the technique of undercut isolation has been reported recently.? A 
schematic cross section of a device made using this technique is shown 
in Fig. 1. The essential feature is a method of forming electrically 
isolated but self-aligned metal electrodes on two thicknesses of oxide. 
By connecting the electrodes in pairs, which may be done externally 
or using electrochemically plated regions on the device, as shown in 
Fig. 1, a two-level oxide structure** that may be operated in a two- 
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Fig. 1—Schematic longitudinal cross section of an undercut-isolated stepped-oxide 
D. 


phase mode is obtained. This structure has several advantages over 
other structures. For example, compared to the three-phase structures, 
the geometrical constraints of having three phases, the requirement to 
fabricate 2- to 3-um gaps, and the instabilities associated with the 
exposed oxide in these gaps are removed. Compared to other two- 
phase structures,*~> there is no need for a refractory metal technology 
or ion implantation, and the packing density of elements can be higher. 

Test devices using undercut isolation and 250 elements long have 
been fabricated, and their transfer inefficiencies measured. The increase 
in number of elements from an earlier device? has allowed the small 
values of transfer inefficiency inherent in this structure to be measured 
accurately. 


II. DEVICE FABRICATION 


The 250-element CCD, which is the subject of this paper, was 
fabricated using the same photolithographic masks, except for two, 
as an earlier 500-element three-phase device® so that the undercut- 
isolated structure could be quickly evaluated. A photograph of one 
end of a finished device is shown in Fig. 2. The transfer region with the 
alternate thin and thick oxide levels is seen in the center of the photo- 
graph. The transfer electrodes are connected alternately on either side 
directly to two metal buses and via diffused cross-unders to two other 
buses. These cross-unders are not necessary for a two-phase CCD but 
were retained from the earlier three-phase device design to enable 
four-phase operation to be carried out for experimental purposes. 
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Fig. 2—View of the output section of a finished 250-element device, showing output 
diode D, output gate G, electrodes E1 and E83 over thick oxide, and electrodes E2 
and 14 over thin oxide. 


The transfer region is defined laterally by a ‘‘channel-stopping”’ 
diffusion that enhances the substrate doping. A device made exactly 
to the mask dimensions would have transfer electrodes that were 11 
um long over the thin oxide and 7 um long over the thick oxide, with 
an 18-um-wide channel. The devices were fabricated as described in 
an earlier paper? on both n- and p-type substrates. 


III. MEASUREMENTS OF TRANSFER INEFFICIENCY 


In order to measure the performance of the devices, voltages and 
pulses appropriate for either p- or n-channel devices, and two- or 
four-phase modes of operation were provided. Owing to circuit limi- 
tations, negative square pulses for driving p-channel devices up to 
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frequencies of 10 MHz, and positive pulses up to 2 MHz, for testing 
n-channel devices were available. However, the n-channel devices 
could also be driven at up to 7 MHz using sinusoidal drive. As has 
already been mentioned, the device was made by modifying the design 
of an existing device, which had a very narrow transfer channel, so 
that the size of the output signal was not as large as is really desirable 
for easy and accurate measurements of transfer inefficiency. With a 
pulse voltage of 20 V, the maximum size of a charge packet was 0.5 pC. 
In all the measurements, a background charge was injected into the 
device so that all the elements carried a small charge so as to keep the 
interface states filled. Varying the amount of background charge in 
the range from 20 to 80 percent of a full charge packet caused no 
appreciable change in the measured transfer inefficiency. 

At frequencies up to 2 MHz, the transfer inefficiency « was measured 
by periodically injecting a single packet of charge into the device and 
observing the sequence of charge packets that emerged. The injection 
of charge was done either optically with a small light spot projected 
through a microscope or electrically. An advantage of the optical 
method is that the light spot could be moved near the output and the 
form of the output signal for a small number of transfers could be 
established. Also, by moving the spot along the device and observing 
the output signal, any discontinuities in transfer efficiency at a region 
of poor transfer, possibly caused by a partially blocked channel or 
an open electrode, could be detected and the device rejected. Obtaining 
a numerical value for e from the observed sequence of output charge 
packets is based on comparison with the expected sequences’:® for 
different values of transfer inefficiency product ne, where 7 is the 
number of transfers. 

Particularly for measuring values of ne > 1, it is more accurate to 
use another technique in which a sinusoidal input at different fre- 
quencies is fed to the device and the amplitude of the output measured. 
The frequency response of the device corrected for the response of the 
output amplifier is then plotted. Comparison with the theoretical 
response curves’ enables values of ne to be obtained. The advantage 
of this method is that values of transfer inefficiency for high values of 
drive frequency f, could be obtained using input signals and amplifiers 
with bandwidths much lower than the drive frequency f,. 


IV. MEASURED VALUES OF TRANSFER INEFFICIENCY 


A plot of transfer inefficiency versus frequency for both n- and 
p-channel devices operated in the two-phase mode is presented in 
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Fig. 3—Measurements of transfer inefficiency (per transfer) for both n- and p- 

channel undercut-isolated CCDs. The theoretical values for p- and n-channel 


devices assuming a 9-um field free region under the transfer electrodes on the thin 
oxide have also been plotted. 


Tig. 3. The transfer inefficiencies for both the p- and the n-channel 
devices, as predicted by calculations of charge motion® for devices 
with mobilities of 200 and 400 cm? V—'s™ respectively, are also shown 
on the figure. A 9-um-long field free region is assumed under each 
electrode on the thin oxide, since the electric fields from the neigh- 
boring electrodes will penetrate at each end of the electrode. 

Referring to Tig. 3, the transfer inefficiency of the devices appears 
to be flat below 0.5 MHz, perhaps due to limitations caused by inter- 
face states. Above 0.5 MHz, the transfer inefficiency progressively 
degrades until, for the p-channel device, it rises exponentially following 
the theoretical curve. The rounding of the experimental curve is due 
to the joint contributions of the interface states and the dynamics of 
charge transfer. The greater carrier mobility in the n-channel devices 
is reflected in the lower transfer inefficiencies of these devices at 
frequencies in excess of 1 MHz. 


6 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1973 


PICTUREPHONE 
RESOLUTION CHART 


DIRECT DELAYED 





Fig. 4—Showing the use of a 250-element n-channel undercut-isolated CCD to 
delay a Picturephone® video signal by 121 ys. The direct (undelayed) display is seen 
on the left and the delayed display on the right. 


The transfer inefficiency per transfer obtained on the p-channel 
device operated in the four-phase mode at 1 MHz was 5 X 1074, 
which is about the same as that obtained in two-phase mode. However, 
in the two-phase mode, only half the number of transfers are required 
for a CCD with the same number of elements so that a device operated 
in this way has half the transfer inefficiency product ne of one operated 
in four-phase mode. This is an important observation, not only because 
of the improved performance, but because of the additional advantage 
that two-phase interconnection gives to the design of functional 
devices. 

The p-channel device was used, as described in more detail else- 
where," to delay a Picturephone® video signal by 121 us with barely 
noticeable degradation in the display as shown in Fig. 4. 


Vv. CONCLUSIONS 


The structure described has given transfer inefficiencies which are 
more than adequate to permit the design of devices for many appli- 
cations. The two-level oxide structure with electroplated intercon- 
nections leads to some relatively simple designs of devices for various 
applications. The active region of the device is fully protected with a 
double-layer oxide and there are no exposed regions of oxide which 
can charge up and degrade the performance of the devices. There is 
no need for refractory metal electrodes and high-temperature processing 
to obtain a good second-level dielectric layer for insulation, or for fine 
features to be etched in the metallization as required in other struc- 
tures. In addition, there is no critical reregistration required in the 
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cell, so that compared to other structures, a smaller cell may be 
fabricated given the same fabrication tolerances. This would lead to 
higher packing densities and a capability of operating at higher 
frequencies. An encapsulant may be required to protect the undercut 
regions from dirt, damage, and electrical breakdown. 


VI. 


ACKNOWLEDGMENTS 


We acknowledge the technical assistance of D. A. Brooks, P. M. 


Ryan, and E. J. Zimany, Jr., in making the measurements. 


REFERENCES 


1. 


Oo oo Rw 


10. 


11. 


Boyle, W. 8., and Smith, G. E., “Charge Coupled Semiconductor Devices,” 
B.8.T.J., 49, No. 4 (April 1970), pp. 587-593. 


. Powell, R. J., Berglund, C. N., Clemens, J. T., and Nicollian, E. H., “A Two- 


Phase Stepped Oxide CCD Shift Register Using Undercut Isolation,’’ Appl. 
Phys. Ltrs., 20, 1972, pp. 4138-414. 


. Kahng, D., and Nicollian, E. H., U. 8. Patent Number 3651349. 
. Kosonocky, W. F., and Carnes, J. E., ‘Charge Coupled Digital Circuits,’? IEEE 


J. Solid-State Circuits, SC-6, 1971, pp. 314-322. 


. Krambeck, R. H., Walden, R. H., and Pickar, K. A., “Implanted Barrier Two- 


Phase Charge Coupled Device,’’ Appl. Phys. Ltrs., 19, 1971, pp. 520-522. 


. Bertram, W. J., Sealer, D. A., Séquin, C. H., and Tompsett, M. F., “Recent 


Advances in Charge Coupled Imaging Devices,” IEEE INTERCON Digest 
of Papers, 1972, pp. 292-293. 


. Joyce, W. B., and "Bertram, W. J., ‘Linearized Dispersion Relation and Green’s 


Function for Discrete Charge Transfer Devices with Incomplete Transfer,” 
BS.T.J., 50, No. 6 (July-August 1971), pp. 1741-1759. 


. Tompsett, M. F., “Charge Transfer Devices,” J. Vac. Sci. Tech., 9, 1972, pp. 


1166-1181. 


. Strain, R. J., and Schryer, N. L., “A Nonlinear Diffusion Analysis of Charge 


roped Device Transfer,” B.S.T.J., 50, No. 6 (July-August 1971), pp. 1721- 

1740. 

Tompsett, M. F., “The Quantitative Effects of Interface States on the Perform- 
ance of Charge Coupled Devices,” International Electron Devices Meeting, 
Washington, October 1971. To be published, IEEE Trans. Electron Devices, 
January 1973. 

Tompsett, M. F., and Zimany, E. J., Jr., “Use of Charge Coupled Devices for 
Analog Delay,’’ ISSCC Digest of Technical Papers, 1972, pp. 136-137. To be 
published, IEEE J. Solid-State Circuits, April 1973. 


Copyright © 1973 American Telephone and Telegraph Company 
Tue Beuu System TECHNICAL JOURNAL 
Vol. 52, No. 1, January, 1973 
Printed in U.S.A. 


Formulas on Queues in Burst Processes—I 


By B. GOPINATH, DEBASIS MITRA, and M. M. SONDHI* 
(Manuscript received July 11, 1972) 


Queues arising in buffers due to etther random interruptions of the 
channel or variable source rates are analyzed in the framework of a single 
switched system. Examples of systems to which the results of the paper 
may be applied are: multiplexing of speech with data in telephone channels 
and, in certain instances, buffering of data generated by the coding of 
moving images in the Picturephone® system. The switched system con- 
sists of a uniform source, buffer, switch and channel. The source feeds 
data to the buffer at a uniform rate. The buffer’s access to the channel 1s 
controlled by the switch; of the switch is closed, the buffer empties to the 
extent of the channel’s transmission rate. The on-off pattern of the switch 
is indicated by a O — 1 burst process {E;}, j = 0, 1, 2, ---; of E; = 0, 
the switch ts closed for the duration [j, 7 + 1). The burst phenomenon is 
introduced to account for two different processes responsible for the event 
E; = 0. There are relatively long periods during which E; = 0 uniformly, 
and the activity separated by such periods ts defined to be a burst. During 
a burst, HE; = O only infrequently. The duration of a burst is an tnde- 
pendently distributed random variable with a geometric or weighted sum 
of geometric distributions. The inter-burst periods are assumed to be 
sufficiently long for the buffer to empty at some point during these pertods 
of inactivity. During a burst {E;} is a Bernoulli sequence of independent 
random variables. 

Exact expressions for a variety of performance functionals related to 
the system described above are obtained, together with qualitative results. 
Recursive formulas are obtained for the following: (i) steady-state distri- 
bution of buffer content for a finite buffer of size N; (at) mean time for 
first passage across a level N; (1) the probability of overflow, for a given 
level N, during a burst; (wv) mean time for first passage across a level N 
during a burst. The recursion in each case 1s with respect to N. The 
asymptotic behavior of the main recurstons 1s determined. 


* The sequence of names was determined by coin tossing. 
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I, INTRODUCTION 


A convenient framework for an unified analysis of a variety of 
digital communication systems involving buffering—some are discussed 
later—is provided by the system in Tig. 1. The source emits data uni- 
formly at the rate of one symbol per unit time. The transmission rate 
of the channel is (k + 1) symbols per unit time where k is some positive 
integer. The buffer has access to the channel only when the switch 
is closed. The switch is controlled by a burst process {£;}, 7 = 0, 1, 
2, ---. H;, for every J, is either 0 or 1. If E; = 0, the switch is closed 
for the duration [7, 7 + 1); otherwise the switch is open. The burst 
process is introduced to account for cases where two basically 
different types of phenomena are responsible for the event E; = 0. 
There are relatively long periods during which £; = 0 uniformly; 
the activity separated by such periods is defined to be a burst. On 
the other hand, during a burst, #; = 0 only infrequently. The dura- 
tion or length of a burst is a random variable. It is assumed that 
the burst length is independently distributed with a geometric or 
a weighted sum of geometric distributions. The interburst periods 
are assumed to be sufficiently long for the buffer to empty during 
these periods. The statistical assumption made in the paper about 
the controlling sequence {/#;} within a burst is that it is a Bernoulli 
sequence of independent random variables and Pr{#; = 1} = a where 
0 < a < 1. In a companion paper, the case where {£;} is first-order 
Markov will be considered. 

Important aspects of various digital communication systems are 
subsumed within the framework of the system described above. 
Diverse schemes for multiplexing data with speech on telephone 
channels! are representative of one class of such systems. A summary 
of the main features of the system which has been described in some 
detail in Ref. 1 follows. The central idea is to utilize the telephone 
channel during the periods of silence in speech which amount to as 
much as half of the total conversation period to transmit digital data. 
The speaker needs to have priority for the use of the channel since 
otherwise the quality of speech is impaired. #; = 0(1) corresponds to 
the decision that silence (speech) exists during the interval [j, 7 + 1) 


UNIFORM 
SOURCE BUFFER O O CHANNEL 
SWITCH 


Fig. 1—Switched communication system. 
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so that only after it is decided that the speaker is silent does the 
buffer have access to the channel. An excellent example of the burst 
phenomenon may be found in speech monologues. Due to the presence 
of phrases in speech, two types of silences, interphrase and intraphrase 
silences, exist ;*:* the former type consists of silences no less than 250 
ms long and this is substantially longer than the mean duration of 
(uninterrupted) intraphrase silence. 

There exists another class of digital communication systems com- 
posed of systems with only one source which transmits at a nonuniform 
rate. Most of the time the source rate is less than, say, 7» bits per unit 
time and 7p is less than the channel rate r. Occasionally, for short 
periods of time, the source rate spurts to a level ri: which exceeds r. 
During such periods buffering becomes necessary. These occasional 
bursts of overloading of the channel are indicated by the {£;} process. 
The relation to the switched communication system of Fig. 1 is clear 
if (r: — r) is normalized to unity, and (r — ro) corresponds to k. 

An example of such a system for which the analysis of this paper is 
relevant arises in buffering of data generated by the coding of moving 
images in the Picturephone® system.® In this case, of course, 7) and 
r, should be interpreted as average rates® in the two regimes, or, when 
the worst case is of interest, as the extreme rates. The results of this 
paper appear to be relevant’ for variable rate in-frame coding, since 
during bursts of high detail, the correlation of the data rates for 
successive picture elements is not high. For frame-to-frame coding 
the first-order Markov model, to be treated in a companion paper, is 
of interest. 

Exact expressions for diverse performance functionals related to 
the system in Fig. 1 are obtained, together with qualitative results. 
As a whole they provide a rather comprehensive set of criteria for the 
design of the important parameters of the system, such as the buffer 
size and the transmission rate of the channel. A summary of the main 
contributions follows.* 

(z) A recursive formula is obtained for the steady-state distribution 
of buffer content for finite buffers. The recursion is with respect to N 
where N is the size of the buffer. 

(zz) It is proved that Fy, the mean time for first passage through a 
level N, is given by 


1 l-—qwr 
Fy = —Fy-1 — 
T 





] 
Fy—sz-1 + ~-: 
T 


T 


* N is used to denote both the buffer size [as in (2), (cz) and (vz) ] and a level [as 
in (zt) and (iv) ]. In what follows, the definition of N should be clear from the context. 
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(t77) gn, the probability of overflow of a buffer of size N during a 
burst, the duration of which is distributed geometrically with a param- 
eter p is given by 


( 1 1 1—-—@fr 1 i 
Qn = —_—e —_— . . 
pw Jn-1 T JN-k-1 


(iv) A closed expression and a recursive formula are obtained for the 
mean time for first passage through a level N during a burst; the 
recursion is with respect to N. 

(v) The asymptotic behavior of all formulas in items (2) through 
(277), as N becomes large, is given. The derivation is dependent on the 
following: of the roots of the polynomials associated with the recur- 
sions, either one or two roots, depending on which recursion is being 
considered, lie outside the unit circle. 

(vt) It is proved that under certain conditions on the initial probabil- 
ity distribution of the contents of the buffer, the probability of a 
buffer being full is a monotonic, nondecreasing sequence with respect 
to time; if the buffer is initially empty, the above-mentioned conditions 
are satisfied. One of the main implications of the result is that the 
steady-state probability of the buffer being full is an upper bound 
on the probabilities of the buffer being full at any instant. Furthermore, 
a particularly simple recursion is obtained for Py, the steady-state 
probability of a buffer of size N being full: 


(- 1 bese sal ) 
Py = —_—e _— . . 
Tw Pea us Py-x1 


(Observe that 1/Py is also the mean time for recurrence of the state 
corresponding to a full buffer.) | 

The closed expressions obtained are for all k and N, and the recur- 
sions hold for all N = 2k + 1. Wherever applicable, the buffer is 
assumed to be initially empty. An important feature of the given 
formulas is that they are also given in the form of recursions. The 
advantages of recursive formulas over the alternate versions need to be 
emphasized. Fora given N, typically, a closed expression for a recursion 
involves inverting a matrix of order N. For large N, the effort is 
substantial. If, in addition, it is borne in mind that a designer is 
interested in functionals associated with a range of possible buffer 
sizes, the advantages of recursive formulas of the form given in this 
paper become overwhelming. This is only to be expected since the 
recursions are obtained by taking into full account the structure of 
the matrices involved. 
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II. EQUATIONS OF PROCESSES 


Let B; be the number of symbols in the buffer at the jth instant. 
For a finite buffer of size N, 
Bjyy = Max{B; —k,0} if £; 


0 
1; 


Since B; depends only on B;_1 and E;_1, the state of the Markov chain 
of interest at the jth instant, S;, is determined by the value of B; 
where B; € {0, 1, 2, --- , N}. Let P,».(n) denote the probability of the 
state Sm = n. Then 


Pr(0) = (1 — 4) D Pm—a(9) (1) 


P(t) = tPmadi — 1) + 1 — 2) Pailé + k) 
i=1,2,.-,N—k (2) 
P(t) = tPm ali — 1) 
. i=N-k+1,N—-k4+2,---,N—-1 (3) 
P(N) = t[Pma(N — 1) + Pna(N)]. (4) 


It is well known from the theory of Markov chains ® that the limiting 
distribution of the states P(z) is obtained from (1) through (4) by 
equating P,(z) and P,-1(2) to P(t). 

2.1 Equations for Some New Probabilities 
Central to most of what follows are the probabilities Q(z), where 
Qm(t) = Pr{ (Sm = 1) O (Bj SN, 7 S m)} 


and the buffer size exceeds N. For convenience, let Xm denote the 
event S; € {0, 1, 2, ---, N} for all 7,0 S$ 7 S m, so that 


Qm(t) = Pri (Sm = 1) V Xn}. (5) 


The equation governing the transitions of {Q;} is derived. It is shown 
that there exists a matrix A which relates {Q;:} to {Q:-1}, i.e., 


Q:(9) = 2 AjiQi-1(1) (6) 


or, in matrix notation, Q; = AQ:-1. 
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In (5) 7 € {0, 1, ---, NW}, so that 
Oni) = Pri Se=— 1) (\ 2444} 
N 
= om Pr{ (Sm = t) (\ X m1 C) (Sim—1 _ ie 
j=0 
Hence, 


Qm(t) = dD Pr{ (Sn = 2) | (Spa = J) 0° Xm-1}Pr{ (Sa = JO X m1} 


DL Prt (Sm = 2) | Sma = 9) OV Xm1}Qm—1(J) 


CL 9) x Qni(j) if 7=0 (7a) 
TQm1(i = 1) + (1 — 7)Qnalé + ke) 
7 if ¢=1,2,---,N—k (7b) 
TQm—1(t — 1) 
if ~=N—-kK+1,N—k+2,---,N. (7c) 
(7) defines the (VN + 1) by (N + 1) matrix A. Sometimes when the 


need arises, the (V + 1) by (N + 1) matrix A associated with a given 
N will be specified by A(N). 
1 2 k+1 k+2 N+1 


(l—7) (I —7)-:-(1 —7) 1 
7 0 Chem). 2 


Pineal We ue 


A=  , (8)* 


"Sg ° 10 N 
T 0 N+1 


It will be observed that the only difference between (7) and the 
transition equations (1) through (4) for a finite buffer of size N, is 
that eq. (4) is modified since a transition from state N to state N is 
not possible in the present context. For the same reason, the matrix A 
is not a Markov matrix since the sum of the elements of the last, 
ie., (NV + 1)th column is (1 — 7), the remaining columns sum to 
unity as is the case for all columns of Markov matrices. 


* The dots indicate continuation of the values of the adjoining elements ; remaining 
elements are assumed to be 0. 
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If the transition matrix of the basic Markov process, i.e., the matrix 
defined by eqs. (1) through (4), is irreducible, then (I — pA) is 
nonsingular for |p| < 1. The proof follows from a well-known result 
in matrix theory’ which in this case states that if all the columns of 
(I — pA) are weakly column-sum dominant and at least one column 
of (I — pA) is strongly column-sum dominant, then the matrix is 
nonsingular. 


III, STEADY-STATE PROBABILITIES FOR FINITE BUFFERS 


In this section, a formula is given for recursively generating the 
steady-state probabilities P(z) where the recursion is, with respect to 
N, the size of the buffer. To distinguish the steady-state probabilities 
for different buffer sizes, the symbol P%(z) is introduced to denote 
P(¢) for a buffer of size N. 

If N = k + 1, as is almost always the case, an equation of the type 
given in (2), namely, 


seal Sy ree (9) 





PN(¢ — 1) —=P¥(i) + 


occurs at least once and since N > k usually, the main body of equa- 
tions defining the steady-state probabilities is of that form. It is 
proved in Ref. 1 what may reasonably be expected, namely, every 
solution of the homogenous set of equations that define the steady- 
state probabilities is of the form 


re k+1 N-j : 
P (j) = = Dima LS 0, 1, oteg N (10) 
t=1 


where y; are the simple roots of the polynomial 


1 l-—qr 
ptt — yk + 
T 





(11) 


T 


If the polynomial has multiple roots the obvious modifications must 
be made. [Note: Since 0 < z < 1, the polynomial in (11) has distinct 
roots whenever 7 # k/(k + 1); when z = k/(k + 1), the only repeated 
root is 1. ] 

The complete recursive formula for P’(j) is obtained in two parts. 
First, a recursive formula for a set of solutions gy(j) to the steady-state 
equations is obtained and, second, a recursive formula for the 
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normalizing constant Zy is obtained. Finally, 


1 
PNG) = 5) j= 0, 1, neg lV. (12) 
N 


3.1 Recursions for {qn(j)} 


Let 
qn(N) = 1 (13) 


and suppose {qw(j)} satisfies the steady-state equations of a finite 
buffer of size N. Hence, g¥ (7) has the form given in (10).* For fixed NV 
andz = 1, 2,---,k +1, let 


k+1 ug 


A 
di = Do ajm; 


j=l 


The transformation {a;} — {d;} is invertible since the Vandermonde 
matrix is nonsingular. Now, 





kl N—(N-#+1) 
di = DY ayy; 
j=l 
= qn(N —i+1) 4=1,2,---,k+1. (14) 
Also, from the steady-state equations themselves, 
di = qn(N) = 1 (15) 
l- 7 
di = qn(N — i+ 1) = — $=2,3,--k+1. (16) 
TT 


Hence, significantly, {d;} is independent of N from which it follows 
that {a:} is also independent of N. 


k+l 
s a 
t=1 
k+1 1 _ 1 — TT _; 
= 2 4; eee tee us : from (11) 
i=1 vis 
1 | 
= —qnik(j) — 
Tv 


Qn+n+i(J) 





— 7 
qn(J) j = 0, hs eae IN (17) 





The formula for {qv+41(j)} is complete if (15) and (16) are appended, 


* To distinguish between {P%(j)} and {qn(j)}, denote the coefficients in the form 
for the latter by {a:}, 1.e., 6; = (1/Zw)ai. 
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1.€., 
l—-r 
Qn+nyi(N a 1) a S pack = 1, 2, ae k (16) 
T —t-+1 
Qnenti(N +h +1) = 1. (15) 


3.2 Recursion for the Normalizing Constant 








Let 
eo : 
Zy = D ayn(J). (18) 
j=0 
Now ‘ 
N+k+1 k 74\i 
=. ea Tae () 
Me ot Nae 
j=N+1 1 (19) 
1 
= oe 
Summing both sides of (17), 
1 1 1 l—qr 
ZN+k+A = = -| 2 = a = 2N 
1.€., 
1 l-—fr 
Dy. (20) 


UN+kEL = —ZN4k — 
ra 


(20) is the recursion for the normalizing constant. The derivation of the 
recursive formula for {P%(7)} is now complete. 

Observe that in the course of the above analysis, a simple recursive 
formula for the rather important steady-state probability of the buffer 
being full, i.e., P’(V), has been obtained. 
qn(N) i 


= (21) 


P(N) = 
(NV) as = 





and 2y, of course, is obtained from (20). 


IV. MEAN FIRST PASSAGE TIME 


Suppose JN is a fixed positive integer and the buffer capacity is 
greater than N. A functional that provides substantial insight into 
the problem of designing a buffer for which the probability of overflow 
is small is Fy, the mean time required for the buffer contents to first 
exceed NV. It is particularly useful in the context of burst processes 
where only incomplete data are available concerning the burst length 
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distribution—provided that the length of bursts is bounded, a simple 
comparison of the bound with Fy provides an useful guide. In this 
section a recursive formula for Fy, the recursion being with respect to 
N, is obtained. To correspond with the practical situation, the buffer 
is initially assumed to be empty; the same recursive formula holds for 
the other interesting initial condition, namely, the buffer initially 
contains an unit symbol. 
Xm is the event that S; € {1,2,---,N} forall 7,0 S$ 7 Sm. 


0;4 Pr{ overflow occurs for the first time at 7} (22) 


Pr{ (Ai = 1) 0 (Sin = N)O X31} 
a Pr{(Si1 = NC X;_1)}, from the independence of {E;} 


II 


= 7Q:-1(N) (23) 
where {Q;} is as defined in eq. (5). It has been shown in Section 2.1 that 
Q: = AQ:-1. (6) 
Hence, 
0; = 7Qi-1(N) 


7(0,---, 0, Qi-1 
x (0, PPE AO; 1)A*Qo 


= re,At1Qq (24) 


where e; denotes the vector* with a single element equal to unity at the 
ith location and all remaining elements 0; r = N + 1. Finally, 


Fy = Mean time for first passage through level N 


lI 
Ms 


Ms * 


10; (25) 


7 


of). 
1€,A*"Qo 


=F 
i=1 


is ( > ia) Qo 


i=1 


= we(I — A)-“\(I — A)“Qo- (26) 


Let 
ate ef(I — A) 


* The superscript ¢ denotes the transpose of a vector. 
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so that 


at(I — A) = e, (27) 
But 
1 
z= — (1, Peeve) (28) 
Tv 
is a solution of (27) since the elements of the last, i.e, (N + 1)th 
column of A sum to (1 — 7) and the remaining columns sum to 1. 
Moreover, (28) is the unique solution of (27) since (J — A) is non- 
singular. Hence, 


Fy = (1,1 ---, Dd — A)*Qo 
= AAO (29) 


where 1 denotes the vector with all elements equal to unity. In the 
following section, the above formula with Qo = e: is analyzed further 
to yield a recursive formula. 


4.1 Recursive Formula for Fy 


Henceforth, it is necessary to be specific about the dimensions of 
A-the matrix A associated with a given N is denoted by A(N). 
The buffer is assumed to be initially empty, i.e., So = 0 or, equiv- 
alently, Qo = é1. 

Since* |I — A(N)|[I — A(N)]}e1 is the vector of (signed) 
cofactors of the Ist row of [J — A(N)] 


[I — A(N)| VO — AWW) ye. = |DW)| (30) 


where D(N) is the (VN + 1) by(N-+1) matrix obtained from A(N) by 
replacing all elements of the first row of A(N) by unity. Then, from 
(29), 


|D(N) | 
kt Seer earth (31) 
|I — A(N)| 
Adding rows 2, 3, ---, (NV +1) of [J — A(N)] to the first row, 
it can be verified that 
|\I — A(N)| = «Xt, (32) 


In Appendix A it is shown that 
|D(N)| = |DW — 1)| — Ad — a)r*|D(N —k—1)| + 2%. (83) 


* |X| denotes the determinant of the matrix X. 
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Hence, 
|D(N) | 1 |D(N —1)| (1 — 7)r* |D(N —k — 1) 1 
2 i ee — ee EE, on 
T vg T wht aN-k rg 
1.e., 
1 l-—qw 1 
Fy = —Fy_1— Pye (35) 
T ve 


The above relation is the desired recursive formula for the mean first 
passage time. It was obtained under the assumption that the buffer 
is initially empty. An alternative assumption about the initial distribu- 
tion, which is also of interest, is that the buffer contains an unit symbol, 
1.e., So = 1. It may be shown that even for this case the mean first 
passage time satisfies the formula (35) though, of course, the initial 
conditions to the recursion in the formula are different. 


V. PROBABILITY OF NO OVERFLOW IN A BURST 


The results of this section are useful when information concerning 
the length of bursts is available. It is assumed that the distribution of 
burst length may be expressed as a weighted sum of geometric distribu- 
tions. Given below are formulas which yield the probability that the 
contents of the buffer during bursts do not exceed N, a given positive 
integer. 

At this stage, assume that the distribution of burst length is geomet- 
ric; the generalization to distributions that are weighted sums of 
geometric distributions will be taken up later. If the burst length is 
denoted by J, then 


Pr{l _ a} = (1 =, p)p*? t= 1, 2, ears (36) 


for some p,9 <p <1. Let Gy 2 Pr {buffer contents do not exceed N 
during a burst}. The usual decomposition into mutually exclusive 
events yields 


Gy= > Pr{8; & (0, 1, set IN, 7 = 0,1, +++, m; 
m21 
and burst length = m} 
= > Pr{Xn Ol = m} 
m21 
= D> Pr{Xn}Pr{l = m}. (37) 


m21 
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The last relation holds since, Pr{X|l = m} = Pr{ Xn}. Now, 


Pr{Xn} = > Pr{Sn =i Xm} 


N 

2d Qm(2) from (5), 

14Qn 

= 14A”Qo (38) 


where A is the (V+ 1) by (N + 1) transition matrix defined in 
Section 2.1 and Q» is the vector given by the initial distribution—it 
may be assumed that So € (0, 1, ---, N). Hence 


Gy = d 1A™Qo(1 — p)p™™ 


m=1 


- me fx (oA)"} Qn 


m21 


I 





Lp 
= eres — pA) — I}Qo 


1— 
= st {1°(I — pA)“Qo — 1}- (39) 


In the sequel, a recursive formula for Gy is developed for the case 
where So = 0 or, equivalently, Qo = 1. 


5.1 Recursive Formula for Gnu 


The matrix A associated with a given N is denoted by A(N). 
|Z — pA(N)|{I — pA(N)}-'e: is the vector of (signed) cofactors of 
the first row of {[—pA(N)}. Therefore, |[—pA(N)|1‘{1—pA(n)}“e1 
is the determinant of the matrix B(N) obtained by replacing every 
element of the first row of {J — pA(N)} by unity. 


BW) 
[I — pA(N)| 
Let the (signed) cofactor of the element {J — pA(N)}1; be denoted by 
Cll, 7 = 1,2, ---,N +1. From the definition of B(N), 


|B(N)| = X CM. (41) 


t=1 


1*{1 — pA(N)}e1 = (40) 
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The elements of the (VN + 1)th column of {J — pA(N)} sum to 
{1 — p(1 — 7)} and the elements of the remaining columns sum to 
(1 — p). Hence, by adding the rows 2, 3, ---, N + 1 torow1, it follows 
that 


[I — pA(N)| = 1 — p) ON + {1 — pl — 7)}On 


N+1 


= (1-9) ¥ C+ {1—p(l— 2) - 1 —p)} 01H 


= (1 — p)|B(N)| + pxCt-*4 from (41) , 


|B(N) | 1 pt Ci N+1 
|I-— pA(N)| 1l—p 1-—p|I— pA(N)| 
Recapitulating, 





(42) 


Lp 








Gy = [1'{1 — pA(N)}e: — 1] from (39) 
p 
1 —p |B(N) | 
= ire seca en f 
: | iT — eA) | rom (40) 
CL N+H 
=]- "TT — pA(N)| from (42). (43) 


The remainder of the derivation is in two parts. First, a closed form 
expression for C!'+1 is obtained. The second part is on the recursive 
formula for | — pA(N)| and this formula is derived in Appendix B. 


1] 23 k+1k42 N+1 
(EER hee ® 1 
Be 1 0 Om 2 
I — pA(N) = 7 ce a (44) 
“10 » |N-—k+1 
x» 1 O N 
wo od N+1 
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where —p(1 — 7) = \ and —pz = u. C'%+1, the (signed) cofactor to 
{I — pA(N)}1,n41, is the signed determinant of an upper triangular 
matrix; 
CLN+1 = (—1)¥42y% — (—1)%*#2(—pr)*¥ 
= pNa, (45) 


In Appendix B it is shown that if zy, a scalar, is used to denote 
|I — pA(N)|, then the following recursive formula holds: 








UN = XLN-1 — petrk(1 == w)LN—k—-1- (46) 
Hence, 
oN, (=>) ae (Ss) (47) 
aN pN rp \eN~lpN-1 ae aN—k-1pN—k—1 
Let 
x |I — pA(N)| 
8 Se Poel as 
Tp Tp 
so that, 
1 l—@wr 
Yn = — Yn-1 — YN—k-1. (49) 
Tp 
From (48) and (45), 
aNt pW 
Gy = 1 -—- ————_— 
[I — pAW)| 
1.€., 
Tv 
Gy =1-—- (50) 
YN 


(49) and (50) together provide the desired recursive formula for the 
probability that the contents of the buffer does not exceed a given level 
N during bursts if the buffer is initially empty and the distribution of 
burst lengths is geometric. 

Suppose the distribution of burst lengths is the weighted sum of 


geometric distributions; 1.e., 
J 


Pr{burst length = 7} = }° a;(1 — p;)(p;)*. (51) 
j=l 
It may then be shown that Gy = 72104, vy Where G;,n is obtained 
from (50) and (49) with p replaced by p; in the latter equation and 
G;,n identified with Gy. 
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VI. MEAN TIME FOR FIRST PASSAGE IN A BURST 


In Section IV certain formulas for the mean time for first passage 
across prespecified levels are given. In burst processes where data 
regarding the length of bursts is available, a more meaningful functional 
is one in which a level is defined to be crossed only if this event occurs 
during a burst. Bursts, then, may be visualized as a period of observa- 
tion of the buffer. First passage across N, a positive integer, is defined 
to occur at 2 if 


{S; 3 N,j =0,1,2,---,7-—1landS; = N+ 1 and, 

burst length =}. 
Let R; denote the probability of this event. The functional of interest 
is Hy = >0;.,1R,. The burst length distribution is assumed to be 
geometric ; generalization to larger classes of distributions may be un- 


dertaken as indicated in the preceding section. Hence, if | denotes 
burst length, 


Pr{l = 7} = (1 — p)p*? +1 =1,2,--- (52) 
for some p, 0 < p <1. 
In the notation of Section 2.1, 
R: = Pr{Si1 =NOA XUN Bai = 1 O12 2} 
= Pr{Fy1 = 1})Pr{Sia = NO X-ill = i} Pr{l = 2} 
=mPr{S_1 = NO X-1}Pr{l = 7} 
=7Q:-1(N)p*" 


= re,(pA)1Qo. (53) 


A is, of course, the (V + 1) by (N + 1) matrix defined in Section 2.1 
and Qo is the initial condition vector. 


Hwy = 7e(X t(pA)*)Qo 
721 
1.e., 


Hy = re,(I — pA)“(I — pA)“Qo. (54) 


The above concludes the derivation of the closed formula for H y—the 
rest of the section is concerned with recursive versions of the formula 
for the case where the buffer is initially empty, i.e., Qo = e1. Once again, 
it is necessary to revert to the use of the symbol A(N) to denote the 

matrix A associated with N. | 
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For fixed N, 
2(p) Se, X p4A*(N)ex 
i=0 
= pe, X {pA(N)}*e1 
i2=0 
= pe,{I — pA(N)}~e1. 
Hence, 


I 


ey (i + DpiAt(N)er 


+20 


; a d 
z'(p) = a, 


e, & i{pA(N)} eq 


#21 


1 
mae fer 


Tv 


Returning to 2(p) and (55), observe that 


pola 
2(e) = ————__-_ 
[I — pA()| 
CNH = pNqN, 
Hence, 
pH 
2(p:) = ——___— 
[I — pA(N)| 


and, from (56), 


d pNtlq N+ 
reo br 
dp ||I — pA(N)| 
Let 
1 A pNtly_N+1 


vy | =pA(N)| 


25 


(55) 


(56) 


(57) 


(45) 


(58) 


(59) 


Since vy = (1/p7)yn where yw has been defined previously in (48) and 
the recursion in (49) for yy is linear, vy satisfies the same recursion. 
Hence, with uy = (d/dp)un(p), the following formula is obtained: 


UN 

Hy = —-— 
2 

UN 


(60) 
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and, 
1 l—-—qd 
vN = — UN-1 — UN—k-1 (61) 
pt T 
1 l-—@f 1 
UN = — UN-1 — 





UN—k-1 — ——UN-1. 
pr 


(62) 


pw w 


VII. ASYMPTOTICS OF RECURSIONS 


The main recursions occurring in the paper are of the following 
forms: 








1] 1l-—q-r 

tn = —XnN-1 — ®N—k-1 (63) 
w v 
1 1-7 1 

YN = — YN-1 — YN-k-1 +- (64) 
T T 


1 l-— qr 
en >= — én-1 — 
pw 





2N—k-1 whereO <p<i1. (65) 


Equation (63) occurs in the formula for the (unnormalized) steady- 
state probabilities and in the formula for the normalization constant ; 
(64) occurs in the formula for the mean first passage time ; (65) occurs in 
the formula for the probability of no overflow during bursts. The 
fundamental solutions of these recursions are obtained from the roots 
of the following polynomials. 

l-@7 





a alee ee (66) 


Tv 


l-—q 





1 
Gu) = wth — — ph + (67) 
pt 


Tv 

Equation (66) is associated with (63) and (64); (67) with (65). The 
two results given below enumerate and estimate the roots of F(u) and 
G(u) outside the unit circle. 


Lemma 1': Except for one positive real root 1/0, and 1, all other roots of 
F(u) lie inside the unit circle |u| S 1. The root 1/6 lies outside the unit 
circle tf and only if k > r/(1 — 7). 

Lemma 1 is a specialization of a result proved in Ref. 1. Bounds on 


6 are also given there. 


Lemma 2: G(u) has k roots inside the unit circle |u| S< 1, no roots in the 
annular ring 1 S |p| S1/p, and one real, positive root outside the 
circle |u| S 1/p. 
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Proof: 
1 poe 
C0) So S05 
Tv 


G(1) ee 2h (p< ® 


eae) = —*[1-—] <o 


k 
peti 





Since G(O) > 0 and G(1) <0, there exists a real positive root of 
G(u), r, where r < 1. Since G(1/p) < 0 and G(u) > © as p> ~, 
there exists a real positive root of G(u), R, where R > 1/p. The 
following theorem which is stated without proof may now be applied. 


Pellet’s Theorem:’° Given the polynomial 
f(@) = do + aye + +++ + ap2? +--+ + nz", ap XO. 
If the polynomial 
F(z) = |ao| + larle +--+ + lapale?t. 
— |a@plz? + [apy1[2?** + +++ [an l2” 


has two positive zeros rand R, r < R, then f(z) has exactly p zeros in or 
on the circle |z| < rand no zeros in the annular ring r < |z| < R. 


Identifying p with k, n with k + 1 and f(z) with G(u) the rest of 
the proof follows. 
The reader may now verify that, for large N, 


tunic, if k< 





l-—@7r 


~C,+ C.N if k=nr/l—-—a7 





] N 


l-—7 





yn = C1 + NC, if k< 


l-—@wr 


~O,+NC,+NC, if k=a/l—or 





l N 
= Cr+ NOs + 0,(;) if k> 


l-—@7r 


Zn => Ci(R)* 
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where, # and 1/6 are roots previously defined and the C’s are constants. 
The constants may be obtained by fairly straightforward computations. 

The qualitative difference between the forms of the expressions 
corresponding to k < r/(1 — 7) and k > x/(1 — 7) are noteworthy. 
This is not unexpected, since it may be recalled that in Ref. 1 it was 
proved in a more general context that the Markov chain associated 
with the infinite buffer is positive recurrent if and only if k > 7/(1—7). 


VIII. A MONOTONICITY PROPERTY OF THE PROBABILITY OF A FINITE 
BUFFER BEING FULL 


The steady-state probability of a buffer being full, i.e., P(N) where NV 
is the size of the buffer [see Section III and, in particular, eq. (21) ] 
may be expected to be an important factor in the practical design of 
buffers. This is so not only because of the immediate implications of 
the definition but also because 1/P(N) is the average recurrence time 
of state N. However, this approach would appear to overlook the 
possibility that the probability of the buffer being full in the transient, 
i.e., in the approach to steady state, is seriously underestimated by 
P(N). Such an event is not easy to rule out because, after all, P(N) is 
an element of only one (normalized) eigenvector of the transition 
matrix while all the modes or eigenvectors and eigenvalues of the 
matrix contribute to yield P,(N) when m is finite. However, one of the 
implications of the result in this section is that, under certain conditions 
on the initial probability distribution of the contents of the buffer, 
P(N) is indeed an upper bound on P»(N), ie., Pm(N) S P(N), 
m =0, 1, --- ; furthermore, the important case of the buffer being 
initially empty satisfies the conditions just mentioned. 

For a buffer of size N, the result states the following. Suppose at the 
mth instant the state probabilities satisfy the inequalities: 

itk 
mPr(t) —(1—7) 2 Pa(j)20 1=0,1,---,N—k (68) 


jeitl 


Pmt) — (1 —7) 2 Pn(j) 2 0 


j=i+l1 
i=N—-kK+1,N—kK4+2,---,N—-1. (69) 


Then (a) Pm(N) S Pmsi(N), and, as shown below, (b) the inequalities 
in (68) and (69) are satisfied with P,,(l) replaced by Pmii(l) for l = 0, 
1, 2, --- , N. Therefore, if (68) and (69) hold, P:(N) S Pisi(N) for 
all 1, 7 2 m;1.e., the probability of the buffer being full is a monotonic, 
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non-decreasing sequence. (a) may be trivially verified. The proof of 
(b) is as follows. 


(i) i= 0. 


TP myi(t) —  — 4) {Pmgilé + 1) + Part + 2)0++ + Pmaili + &)} 
=ar(1 — r)[Pa(0)+ Pal) + --> + Palk)] — 1 —a)r 
<X [Pn(0) + Pal) +--+: Palk — 1) -— (1 — rPLPa(k + 2 
= (1 — m)[tPa(k) — (1 — 7){Pnlk + 1) 
+ Pu(k +2) ++ + Pa(2k)}] 2 0 


(ii) 1S i< N — 2k. 


TP mgr(i) — (1 4) {Pmt +1) + Payal + 2) +++ + Piri t )} 
=n(1 — m)[Pn(0) + Pa(l) + +++ + Pn(k)] — (1 — m)rLPn(0) 
+ PraQ1_)+--> + Pa(k -—- 1) - 1 — 7)P*LPak+ 1D 
+ Pn(k +2) + +++ + Pa(2k) 
= (1— w)[ePa(k) — (1 — 2){Pa(b + 1) 
+ Py(k +2) + +++ + Pa(2k)}] = 0 
(wi) N-2k+1s5isN—k-1. 


TP myi(t) — (L — w){Pmyi(t + 1) 
sf wept 2) ee eae he) 
=rlPrati-1)+ 1 —7)Palit kh] - 1-7) 
X tLPm(t) + Poi +1) +--+ + Pai tk —1)] 
—(l—7P[PraG@tk+1)4+ PaGitk+2)+---+P,(N)] 
= a[rPr(@i— 1) — (lL — w){Pn@) + Pali + 1) 
+e) +PaG+tk—D}]J +0 —wMirPaGith —- A-7) 
X {Pat +k+1)4+ Pai tk +2) + --- + Pua(N)}] 20 


(wv) 1= N —k. 


Pp hG == a) (Pi 
+ Pingilt + 2) + 2+ + Paryilt + )} 
= airPp(t — 1) + (l— 7)Pali +k] — (1 — 2)tlP alt) 
+ Pult+1)+e++ + Pat +h —1)j — 1 — w)eP.(N) 
= x[eP,(é — 1) — (1— 2) {Pad 
+ PrGt +--+ +PaGitk—-1j}]=0 
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w~) N-k+1s185N —-1. 


TP mit) — (L = ©) {Pmyil@ + 1) + Pmgilt + 2) +--+ + Piil(N)} 

= alaPr(t—1)—-A— m{ Pa) + Palit 1) 

+--+ +Pa(N)}J 2 0 
(b) is proved. 

Observe that Pm(0) = 1, Pm(t) = 0,72 = 1, 2, --- , N satisfies the 
inequalities (68) and (69). However, the other initial distribution of 
interest, namely, Pm(0) = 0, Pu(1) = 1, Pu(t) = 0,7 = 2,3, ---, N 
does not satisfy the inequalities. Also, it may be verified that for the 
latter set of initial conditions, the monotonicity property does not hold. 

It is interesting to note that if (68) and (69) hold, then P,,(0) 
> Pm4i1(0), so that together with (b), P:(0) = Pii1(0) for all 7,72 = m, 
i.e., the probability of the buffer being empty is a monotonic, non- 
increasing sequence. 


APPENDIX A 


Recursive Formula for |D(N) | 


1 2 k+1 N N+1 
12 lee se Sel. Re ee oe, Gat tas Soe 1 1 
a 1 0 (r—1) 0 2 
DW) = 
-r 1 0 (r—1)|}|N-—-k+1 
—7r 1 0 —k+2 
1 0 N 


Expanding |D(N)| along the (N + 1)st row yields 
|[D(N)| = |DW — 1)| + 7/X| (70) 


where 


FORMULAS ON QUEUES-I 3l 


be 
I> 
| 

q 

o 


N-1 N 
1 1 1 
2 
” gape oe Jaa 
0 0 IN=k 


0 (rx -~1)|N—k+1 


Expanding |X| along last column: 


[Xb (aa) ae Dey |) 1) 
where 
1 2 k+1 N-1 
1 1, 4 1 1 
7 1 0 r—l 2 
y& 
—T -1 O a—-1|N—-—k—-—1 
N—k 


Expanding | Y| along the last (&k — 1) row. 
[VY] = (—x)**|DW — k — 1]. 


32 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1973 


Combining (70) through (72): 
|D(N)| = |D(N — 1)| — 1—7)r*|D(IN-—k—1)|. (78) 


APPENDIX B 

Recursive Formula for |I — pA(N)| 

I — pA(N) is given in (44). Let x, denote |IJ — pA(N)|. Also, 
= —p(l—-7) and p= —pr. 


(i) Expand |I — pA(N)| along the last, ie, (N + 1)th row, of 
LI — pA(N)]. 


ty = ty-1 — p|X| (74) 
where 
1 2 k+1 N 
1+rA AX... 6 Ah CUO 0 1 
Be 1 0 r 2 
A A OJ N—k-1 
x= 0 O|N—k 
0 AIN—k+1 
“php 1 O 
wp OJIN 


(7) Expand |X| along the last, i.e., Nth column of X. 
|X| = (—1) FD +9 | Y| 


= (—1)-A|Y| (75) 
where 
1 2 k+1 
1+.A AXA LA 1 
Be 1 A. 2 
yA 
“php “1 0 A|N—k-1 
xp 1 0 O|N—-—k 
O pn 1 0 Oj N-—-k+1 
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(i271) Expand |Y| along the last k — 1 rows. 


|Y | = p*lyy_y-1. (76) 
Hence, 
LN = Ly_-1 — p|X| from (74) 
= ty-1 — w(—1)*-A|Y | from (75) 
= Xn-1 — w(—1)* "yi ley from (76) 
= XN-1 — pong (l — w)tn-k-1. (77) 
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A Frame-to-Frame Picturephone® Coder 
For Signals Containing Differential 
Quantizing Noise 


By D. J. CONNOR, B. G. HASKELL, and F. W. MOUNTS 
(Manuscript received July 18, 1972) 


The frame-to-frame coder described in Ref. 1 used an 8-bit PCM signal 
for input. If, instead, the signal ts obtained by digitally integrating the 
output of an element difference coder, the quantization noise may be mis- 
interpreted as motion, and cause unnecessary transmission. In the 
particular example of the Phase I coder,? the quantization noise loads the 
frame codec to the extent that rt produces an unacceptable picture. 

In this paper, a frame-to-frame coder for Picturephone® signals is 
described which is capable of coding the digital output of a Phase I codec 
for transmission over a 2-megabit/second channel. Improved methods are 
used to segment the noisy picture into moving areas and background areas. 
The moving areas are then transmitted using a number of data reduction 
techniques. During periods of slow movement, clusters of frame-to-frame 
differences in the moving area are transmitted. For moderate movement, 
frame differences are sent only in every other field, the moving areas of 
intervening fields being transmitted by a conditional field interpolation 
technique. For rapid movement, 2:1 horizontal subsampling ts used, and, 
finally, during violent motion when the buffer fills, frame repeating 1s used. 

The picture quality obtained from a laboratory simulation of this 
system is believed to be satisfactory even for a very active subject. With 
small amounts of motion the subjective quality 1s actually improved because 
the visibility of the quantizing noise from the Phase I codec 1s reduced by 
the inherent frame repeating action of the coder. 


I, INTRODUCTION AND SUMMARY 


In Ref. 1 an 8-bit-per-picture element (pel) Picturephone-type signal 
is coded using only 2 megabits/second (Mb/s). Clusters of significant 
frame differences are transmitted using a double-length code (four-bit 
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and six-bit) for the frame differences and eight-bit addresses for the 
clusters. During periods of moderate movement, every other significant 
frame difference along the line is transmitted, the intervening elements 
being obtained by linear interpolation. If, during violent motion, the 
buffer fills, frame repeating is used. 

In this system a frame difference was deemed significant if its 
magnitude exceeded some threshold value (T' = 4, 5, 6, 7) which 
depended on the buffer fullness. Two exceptions to this criterion were 
made, however: (2) if a significant change was surrounded on both 
sides by two insignificant changes, then the change was deemed 
insignificant, and (22) if two clusters of significant changes were 
separated by three or less insignificant changes, then the clusters were 
joined by relabeling the intervening changes as significant. 

For maximum flexibility of the Pzcturephone transmission system, it 
is desirable that an interframe coder be able to accept as an input a 
signal that has previously been coded by an intraframe coder, such as 
an element difference coder. Such a signal will have a significantly 
higher level of quantization noise than an 8-bit PCM signal. The 
Phase I codec? is an example of an element difference coder. Since the 
quantization noise from this coder has been carefully shaped for 
minimum visibility, the signal it produces probably contains the highest 
noise level of any signal likely to be encountered by an interframe 
coder. Designing an interframe coder to work with such a signal thus 
reveals many of the problems involved in working with signals having 
realistic noise levels. 

If the input signal contains element differential quantizing noise, 
the system in Ref. 1 does not perform well at all. An inordinate 
number of sizable frame-to-frame differences arise due to the quantizing 
noise, and in the case of the Phase I codec, acceptable video transmis- 
sion at 2Mb/s is impossible. Raising the threshold of significance 
reduces the number of background frame differences which are trans- 
mitted, but it also reduces the number of subjectively important frame 
differences in the moving area which are sent. Unacceptable picture 
quality results. 

Averaged over a small region in space and time, the frame differences 
due to quantizing noise differ in many ways from the subjectively 
important frame differences due to movement. For example, frame 
differences due to movement are correlated spatially, whereas frame 
differences due to quantizing noise are not. 

These properties have been exploited to give a method for segment- 
ing the picture into moving areas and stationary areas.? The moving 
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area as defined by the segmenter tends to be slightly larger than the 
actual moving area, but it has been found that this is necessary if a 
subjectively acceptable picture is to be obtained. 

The number of picture elements which must be transmitted using the 
noisy input and this segmenter is much larger than with the 8-bit input 
and the segmenting criterion of Ref. 1. Thus, even with a good seg- 
menter, the data rate is larger than 2 Mb/s using only the data reduc- 
tion techniques of Ref. 1. Other means of data compression are required 
if a 2-Mb/s rate is to be obtained. 

Two techniques are proposed. First, since the segmenting criterion 
used here requires that all picture elements in the moving area be 
transmitted, a large number of zero frame differences are sent, 1.e., 
the average transmitted frame difference is much smaller than in 
Ref. 1. Under these circumstances, variable word length codes can be 
used to good advantage. Using a variable word length code optimized 
for moderate motion, only about two bits per frame difference are 
required on the average. Using this same code during periods of active 
motion requires about three bits per frame difference on the average. 

Using the new segmenter and variable word length coding of frame 
differences, transmission below 2 Mb/s is easily accomplished during 
periods of slow movement. When motion becomes a little more rapid, 
however, the 2-Mb/s rate is surpassed, and another data compression 
technique must be used. Two-to-one horizontal subsampling generally 
results in subjectively unacceptable picture quality because the 
movement is too slow to hide the resolution loss. Thus, a conditional 
field interpolation technique*:* is used as the second method of data 
rate reduction. 

With this technique, frame differences in the moving area are 
transmitted only during every other field. Each pel in the moving area 
of the intervening fields is obtained at the receiver by a four-way 
average of vertically adjacent picture elements in the two fields 
adjacent to the one being coded. However, if the four-way average is in 
error by an amount larger than some prescribed threshold, then a 
quantized correction value must be sent to maintain acceptable 
picture quality.‘ 

The receiver as described above would still have to be told 
which picture elements in the intervening field are in the moving 
area, and which are in the background. However, since movement is 
so highly correlated from field to field, we believe that this information 
can be extracted from the two fields adjacent to the one being 
interpolated. 
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With rapid motion, 2:1 horizontal subsampling can be employed. 
This is brought in under buffer control. When motion becomes violent 
and the buffer fills, then transmission ceases for one frame period and 
the previous frame is repeated. 

Using the data compression techniques described above, a laboratory 
simulation was constructed to test the important aspects of a 2-Mb/s 
frame-to-frame codec that is capable of coding the digital output of a 
Phase I codec. A simplified block diagram of the simulation is shown 
in Fig. 1. A digital signal identical to the output of the frame-to-frame 
codec was passed through another digital Phase I codec without degrad- 
ing the picture noticeably. The system described is capable of accom- 
modating about the same amount of movement as that in Ref. 1, 
with a picture quality comparable to that of the Phase I codec. 

The Phase I codec was designed, of course, without any thought of 
frame-to-frame coding. It is not surprising, therefore, that many 
difficulties arise when frame-to-frame coding techniques are applied 
to the output of a Phase I codec. Changes in the Phase I coder to reduce 
the quantization noise would not only result in a simpler interframe 
coder, but could also lead to a data rate less than the 2 Mb/s obtained 
here. How much less will have to await further study. 

The next four sections describe in more detail the operation of the 
frame-to-frame coder. The last section describes the simulation. 


II. SEGMENTING THE PICTURE INTO ‘‘MOVING”’ AND “STATIONARY’’ AREAS 


An essential preliminary to the development of the coder described 
in this paper was the development of methods for detecting or segment- 
ing the moving area in a video signal which has already been corrupted 
by noise due to an in-frame coding operation. A full description of the 
work done on this problem will be given in subsequent papers. In this 
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Fig. 1—A simplified block diagram of the simulation showing the signals used 
and produced by the segmenter. 
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section, we will simply state the various properties of the video signal 
and the coding noise which can be exploited in detecting the moving 
area. Following that, we give a description of the actual segmenter 
that was developed for use in the system described in this paper. 

In order to separate the frame-to-frame brightness changes caused by 
movement from those caused by noise from an element difference 
quantizer, advantage can be taken of certain distinguishing properties. 
The most important property of the movement-generated frame 
differences is that they are spatially correlated. Two properties of the 
noise are important: 


(¢) It is almost entirely uncorrelated spatially ; 

(ii) The magnitudes of individual noise spikes are equal to the 
spacing of the representative levels used in the element differ- 
ence quantizer. 


The second property of the noise results from the fact that in stationary 
areas a small noise perturbation from one frame to the next can cause 
a change in the representative level used to encode a particular element 
difference. This change will be to an adjacent representative level in 
the quantizing scale, and, consequently, the resultant frame difference 
will be equal to the spacing of those levels. The more widely spaced 
outer levels of the companded quantizing scale are used to encode 
detailed areas and contrasty vertical edges. Thus, the frame difference 
noise is greatest in these regions. 

Finally, a useful property of moving areas is that they are spatially 
and temporally contiguous. In other words, if a pel is in the moving 
area, it is highly probable that the spatially adjacent pels and the same 
pel in the next frame are in the moving area. 

The signals employed by the segmenter in detecting the moving 
area are indicated in Fig. 1. A block diagram of the processing of the 
quantized element difference signal and the frame difference signal is 
given in Fig. 2. The frame difference signal undergoes two separate 
spatial filtering operations which increase the signal-to-noise ratio for 
the spatially correlated frame differences caused by movement. 
Filter A is designed to enhance the frame difference signal associated 
with moving edges and particularly with vertical edges moving 
horizontally. This signal is characterized by high horizontal spatial 
frequencies and lower vertical spatial frequencies. By averaging the 
frame difference signal from adjacent lines, these low vertical fre- 
quencies are enhanced relative to the spectrally flat frame difference 
noise. 
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Fig. 2—A simplified block diagram of the segmenter showing the spatial filtering 
and noise estimation processes. 


Filter B is designed to enhance the frame difference signal associated 
with the movement of relatively flat areas. This signal has most of its 
energy at low spatial frequencies. By averaging the frame difference 
signal in an 8-pel-by-2-line area, an increased S/N ratio is obtained. 
After the averaging operation, the signals from both filters are rectified 
since the frame difference signal can be of either sign. 

Although filter B enhances the movement-generated frame differ- 
ences in relatively flat areas, it is found that in highly detailed, 
stationary areas its output commonly exceeds the output arising in 
slowly moving, flat areas, such as hair. Thus, simple threshold detection 
is no good. However, it is possible to compensate the output of filter B 
for these detail-dependent variations in the frame difference noise by 
subtracting a filtered estimate of the magnitude of the noise signal. 

As mentioned above, individual frame differences caused by quantiz- 
ing noise are equal to the spacing between representative levels of the 
element-difference quantizing scale. Thus, in blocks C and D in Fig. 2, 
the filtered estimate of the noise signal is derived from the quantized 
element-difference signal by generating at the output of block C a 
non-negative signal that is proportional to the spacing between the 
input representative level and the adjacent smaller level in the element- 
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difference quantizing scale. (Because the probability distribution of 
element differences is monotonic and peaks at zero, the most probable 
transition between representative levels due to a noise perturbation is 
from an outer level to the adjacent smaller level.) The estimated 
frame-difference, noise-magnitude assignment for the representative 
levels is modified for the four inner levels of the 16-level quantizing 
scale of the Phase I codec as shown in Table I, which gives the output 
versus input for block C. This modification reflects the fact that noise 
frame-differences are relatively small in flat areas of the picture. 
Experimentally it was found that flat, stationary areas could be more 
easily distinguished from flat, moving areas if no noise compensation 
was used in these regions. Thus, the estimated frame-difference 
noise magnitude for the four inner levels is set to zero. 

The filtered and noise-compensated frame-difference signals serve 
as inputs to the decision logic of block E. This logic takes advantage of 
the fact that moving areas tend to be contiguous both spatially and 
temporally. Thus, if movement is occurring at a particular pel, there 
is a high probability that movement is occurring at pels that are 
spatially and temporally adjacent. Consequently, the philosophy for 
the design of the decision logic was to use a high decision threshold for 
the detection of movement in regions of the picture which were 
previously stationary, and a lower threshold in regions where move- 
ment had recently been detected. 

A block diagram of the decision logic is given in Fig. 3. (For simplic- 
ity, a number of delays required to keep the binary signals in register 
have not been shown.) The filtered and noise-compensated frame- 
difference signals serve as inputs to this logic. They are first converted 
to binary signals by threshold operations having the following transfer 
characteristics, 

B; 


B; 


1 if FET; 
0 if F<T; 


I 


where F is the input, 7’; the threshold, and B; the corresponding binary 
output signal. A control signal from the interframe coder that indicates 
the amount of movement by measuring the buffer fullness is used to 
raise the thresholds 7; and 73; during periods of fast motion.! Move- 
ment detection is easy in this situation, and the segmenting accuracy 
can be increased. 

In order to best describe the operation of the decision logic, we will 
start with the block labeled ‘‘Binary Threshold Logic with Hysteresis.”’ 
This block will be referred to as an N out of MW (N/M) device after 
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Limb and Pease.® A block diagram of this device is given in Fig. 4. 
The accumulator in the N/M device keeps a count of the number of 
ones in the 8-by-3 block of 24 pels adjacent to the pel of interest as 
shown in Fig. 5. (Thus, M is 24.) If the output of the accumulator is 
greater than or equal to the threshold N; = 9, the output flip-flop is 
set; and segmenter output function Bs; becomes a one to indicate 
moving area. In keeping with the design philosophy mentioned above, 
the flip-flop can only be reset by having the output of the accumulator 
drop below the lower threshold N. = 4. Note that by setting N; equal 
to nine, the signal Bs, which indicates the occurrence of flat area 
movement on the present line can never by itself cause the flip-flop to 
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Fig. 3—Decision logic. The N/M device processes binary signals from the present 
and previous fields to produce the moving area signal. 


FRAME-TO-FRAME CODER 43 





THRESHOLD 






















ACCUMULATOR 
TO MONITOR 3 
NUMBER OF SET =| Ftp-|__5 
BINARY ONES RESET |FLOP 


IN 8x3 BLOCK 
AROUND POINT 
OF INTEREST 










THRESHOLD 









| No =4 | 
B,—MOVING EDGES heat 4 RORSME 
B,—-FLAT AREAS MOVEMENT Bg—-MOVING AREA 
B, AND Bg—PREVIOUS FIELD ( ENON SIGNAL 
MOVEMENT OUTPUT = i 
LIF. N> N; 


Fig. 4—N/M device. Two thresholds are applied to the output of an accumulator 
that keeps a count of the number of binary ones in an 8-pel-by-3-line area around the 
point of interest. These thresholds control the state of the output flip-flop along with 
the moving edge signal B,. 


be set. Initially, the only way the flip-flop can be set is for a one to 
occur in the signal B,. Since this function indicates movement of edges, 
edge movement must be detected before flat area movement. However, 
once edge movement is detected, the flip-flop is set and the lower 
threshold N, determines whether adjacent pels on the same line will 
be designated as moving. In addition, referring to Fig. 3, if Ba, which is 
a more sensitive but noisier indicator of flat area movement than Bs, 
is a one when the flip-flop is set, Bs will be a one. Hence, in keeping 
with the design philosophy, the value of Ni for the spatially and 
temporally adjacent pels in the next field will be effectively lowered by 
the appearance of these ones in B; and Bs. As a result of the interactions 
described above, the N/M device tends to fill in moving areas, and to 
designate areas as moving for a short while after they become 
stationary. 

Given the above description of the N/M device, the functions and 
choice of design variables for the various other blocks in Fig. 3 become 
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Fig. 5—Arrangement of the 8-by-3 block of pels monitored by the N/M device. 
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evident. The threshold 7; is set relatively high (~10 on an 8-bit PCM 
scale of 256 levels) to insure that a binary one in the function B, is 
indeed caused by the movement of an edge. This function undergoes 
further processing so that isolated ones (no other ones within two pels 
horizontally in either direction) arising from noise spikes are set to 
zero.' Similarly, the threshold 7’; is set relatively high (~4/256) to 
insure that the condition B; = 1 corresponds to movement in flat areas. 
The threshold 72 on the other hand can be set lower (~2/256), since 
it causes ones to occur in B, only if the segmenter output, Bs, is a one. 
However, it eliminates from Bs most of the ‘‘fill-in’” pels generated by 
the N/M device. This process stabilizes the feedback loop around this 
device. 

If the thresholds 71 to 73 are fixed, they must be set quite low in 
order to detect very slow motion. Given the level of quantization noise 
from a Phase I coder, such low thresholds inevitably lead to the 
inclusion of some background points in the moving area. By using the 
control signal from the buffer, the thresholds can be made speed 
dependent. For even moderate motion, the segmenting is then virtually 
ideal. 


IiI. VARIABLE WORD-LENGTH CODING OF FRAME DIFFERENCES 


In Ref. 1, the 9-bit frame differences (— 255 --- 0 --- +255) were 
quantized into 64 levels. Since the Phase I coder gives an effective 
6-bit signal (6 bits with the seventh bit alternately 0 and 1 along the 
line), only frame differences that are multiples of 4/256 can occur. 
This set of frame differences is sufficiently coarsely quantized for 
efficient transmission. 

Also, in Ref. 1 it was very much easier to separate the subjectively 
important frame differences from those few due to camera and system 
noise. In the system described here, where a Phase I signal is used as 
an input, once the moving area has been identified, all frame differences 
in it must be transmitted since it is not possible to tell which are due to 
movement and which are due to quantizing noise. Within the moving 
area, as defined by the segmenter, many zero frame differences do occur. 
However, since they are randomly interspersed among the nonzero 
frame differences, it is much more efficient to transmit them than it 
would be to delete them and address the remaining nonzero frame 
differences. 

This causes the average magnitude of transmitted frame differences 
to be considerably smaller than in Ref. 1 where an 8-bit input is used. 
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Fig. 6—Typical histogram of moving area frame differences during moderate 
motion. Huffman code word lengths are shown for each level. 


Thus, a more complex variable word-length code can be used to good 
advantage in reducing the average number of bits required to transmit 
a frame difference. Preliminary measurements indicate that with a 
good variable word-length code, less than two bits per frame difference 
are required on the average during periods of slow movement. During 
moderate movement, a little more than two bits per frame difference 
are required ; and during rapid movement, about three bits are needed. 

Figure 6 shows a typical histogram of the magnitude of the frame 
differences in the moving area during moderate motion. Also shown are 
the Huffman code word lengths corresponding to this distribution. 
The average word length per frame difference is 2.05 bits. 


IV. CONDITIONAL FIELD INTERPOLATION 


During very low-speed movement, variable word-length coding of 
frame differences in the moving area is sufficient to code at a rate 
below 2 Mb/s. Unfortunately, the speed at which the bit rate rises 
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Fig. 7.—Four-way vertical averaging. Fields 1 and 3 are sent via frame differences 
in the moving area. Information about moving area pels (E) in field 2 is sent only if 
the interpolation error |E—(A+B+C+D)/4| exceeds a threshold. 


above 2 Mb/s is still too slow to hide the resolution loss incurred by 
2:1 horizontal subsampling. Thus, another data compression technique 
is used. 

With conditional field interpolation (called conditional vertical 
subsampling in Ref. 4) only every other field is transmitted by sending 
frame differences in the moving area. The moving area pels in the 
intervening fields are obtained from a 4-way average of vertically 
adjacent pels in the two adjacent fields. In Fig. 7, fields 1 and 3 
have been transmitted via frame differences in the moving area, and 
pel E is to be sent via conditional field interpolation. Pels A and C are 
directly above E, and pels B and D are directly below E. The 4-way 
average (A + B + C+ D)/4 is computed and used as a prediction 
of E. If the interpolation error does not exceed some prescribed 
threshold value, then nothing is sent, and the 4-way average is used 
in place of E. If the interpolation error does exceed the threshold, then 
a quantized correction value is transmitted. 

Since the receiver treats background area in the interpolated fields 
differently than it does moving area, it must be told which picture 
elements are in the moving area and vice-versa. Preliminary measure- 
ments indicate that addresses for the moving area of the interpolated 
fields could probably be transmitted using less than 0.1 Mb/s. Alter- 
natively, the moving area of the interpolated fields might be satisfacto- 
rily obtained from the union of the moving areas in the two adjacent 
uninterpolated fields. This would not require any additional informa- 
tion to be transmitted. 
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In order to determine whether or not the field interpolation error 
was acceptable, threshold values between 7 and 15 out of 255 were used. 
These values gave acceptable to marginally acceptable picture quality, 
and a data rate which was drastically reduced compared with sending 
frame differences. 


V. BLOCK DIAGRAM 


Figure 8 shows a block diagram of the system. (The segmenting 
operation is shown in detail in Figs. 2 to 4.) During very slow movement, 
every field is transmitted by sending frame differences (B’ — D) in 
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Fig. 8.—Frame-to-frame coder for Picturephone® signals with Phase I quantizing 
noise. During field interpolation, information from two fields is fed to the buffer 
simultaneously. 
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the moving area as defined by the segmenter. 8; is in the 0 position to 
give an uninterrupted frame memory, and §; is in the 0 position so 
that no interpolation error information reaches the buffer. 8: is 
controlled by the segmenter. When in the 0 position, the previous frame 
value D (see Fig. 7) is fed to delay I, and no frame difference is fed to 
the buffer. When in the 1 position, the new pel B’ = D + (B’ — D) 
is fed to the delay, and a frame difference is fed to the buffer for coding, 
addressing, and transmission. 

When movement becomes more rapid and the buffer fills beyond 
some prescribed threshold, only every other field is sent via frame 
differences in the moving area as outlined above. Mode switching occurs 
only at the end of a field. During input of a field which is to be inter- 
polated, S, and §; are in the 0 positions allowing uninterpolated fields 
to enter delay II unchanged. 8; is controlled by the segmenter as 
before ; however, no frame differences are fed to the buffer for transmission. 
Coding and transmission of this field takes place at a later time. Thus, 
during input of interpolated fields no amplitude information is fed to 
the buffer. Addressing information needed to specify the moving area 
at the receiver could be sent at this time if it is found to be more 
efficient ; however, this information could just as well be obtained from 
the output of delay III and sent later during the actual coding and 
transmission of the interpolated fields. 

During input of uninterpolated fields, coding and transmission of 
frame differences in the moving area are carried out as usual by means 
of switch S;. However, at the same time, coding and transmission of 
interpolated fields are also performed. When pel E in an interpolated 
field (see Fig. 7) emerges from delay I, pels A, B, C, and D are emerging 
from their respective delays as shown in Fig. 8. The output of delay 
III identifies E as either a background or a moving area pel. 

If E is a background pel, S2 and §; are switched to the 0 positions. E 
enters delay II and no information is fed to the buffer. If E is a moving 
area pel, then 8; is switched to position 1, and 8: is controlled by the 
threshold logic T. The threshold logic compares the magnitude of the 
interpolation error [E — (A +B+C+D)/4] with a prescribed 
threshold. If the error is smaller than the threshold value T, then S, is 
opened (0 position), nothing is fed to the buffer for transmission, and 
the 4-way average enters delay II in place of E. If the interpolation 
error is too large, Se is closed (1 position), a quantized interpolation 
error generated by the quantizer Q is fed to the buffer for transmission, 
and the corrected interpolation value is fed to delay II in place of E. 

A number of implementation aspects have not been discussed. 
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Fig. 9—Receiver configuration. Information is read from the buffer two fields at 
a time during field interpolation. 


For example: 


(¢) All moving area picture element information fed to the buffer 
must, of course, be accompanied by addressing information, 
and efficient addressing may require that some of the switch 
control functions be modified, e.g., isolated point rejection, gap 
bridging (see Ref. 1). 

(zt) During field interpolation, information from two fields is fed 
to the buffer simultaneously. Thus, some multiplexing arrange- 
ment must be devised in order to implement the system as 
described. For example, a buffer might be provided for each 
field and the outputs switched. 

(227) The receiver configuration is very similar to that of the 
transmitter (see Fig. 9). 

(zv) Two-to-one horizontal subsampling, and frame repeating have 
not been discussed here since they are covered elsewhere.!'6 


VI. SIMULATION OF THE SYSTEM 


A number of short cuts were taken to simulate the system described 
above. First, no coding, buffering or transmission of the data was 
undertaken. In the simulation, only the picture processing performed 
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at the transmitter was undertaken. The picture which would have 
appeared at a receiver in the absence of transmission errors was 
equivalent to the output of field memory II in Fig. 8. As in Ref. 1, 
buffer control of the picture processing was obtained by using an 
analog integrator to keep track of the number of bits that would have 
been in a real buffer had one been built. Also, as in Ref. 1, a buffer size 
of 67,000 bits was chosen so that it would completely empty if the 
input of data were stopped for one frame period. 

Second, the effect of the variable word-length coding was only 
partially simulated. Recall from Section III that with a good variable 
word-length code, pels in the moving area could be transmitted using, 
on the average, less than two bits per frame difference during periods of 
slow movement, approximately two bits during moderate movement, 
and about three bits during rapid movement. This was simulated by 
counting two bits per frame difference during periods of slow and 
moderate movement and four bits during rapid movement when 2:1 
horizontal subsampling was employed. 

During conditional field interpolation, the same bit assignment 
scheme was used to account for the transmission of interpolation 
errors. Although transmitted interpolation errors were not quantized 
in the simulation, preliminary results indicate that they can be 
quantized quite coarsely. Thus, a 2-bit, 4-bit assignment is not 
unreasonable. 

Transmission of moving area addresses for the interpolated fields 
was not simulated. Preliminary measurements indicate that with rapid 
motion, the number of clusters requiring addressing is, on the average, 
about two per line. If 16 bits are used to address each cluster, then 
about 0.1 Mb/s would be required to transmit them. If, as was con- 
jectured in Section IV, this moving area can be obtained adequately 
from the uninterpolated fields, then no extra information need be 
transmitted. 

Finally, transmitted information from interpolated fields was delayed 
by a field period before being fed to the buffer simulator purely for 
reasons of expedience. This means that during most of conditional 
field interpolation, information from two fields does not enter the 
buffer simulator at the same time as is described in Section V. This 
should not affect the results very much since much less data is gen- 
erated during interpolated fields than during uninterpolated fields. 
However, frame repeating due to buffer filling may occur slightly 
more often in the actual system than it did in the simulation if the same 
buffer size is used. 
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The acceptability of the pictures obtained using the simulation 
described above was determined mainly by comparison with pictures 
from the Phase I codec alone. This codec gives pictures that have 
moderate amounts of both granular noise and edge busyness through- 
out the picture. The frame-to-frame codec described above transmits 
information only about the moving area. Consequently, the Phase I 
codec noise in the background becomes stationary and, hence, much 
less noticeable. In this sense, the pictures are improved. 

Some loss of quality is caused, however, by the use of subsampling. 
Under some conditions, a slight jerkiness in the movement being 
depicted is noticeable as the codec enters the vertical subsampling 
mode. Also, for very high-speed movement, a slight checkered pattern 
at contrasty edges is detectable. This is caused by the use of both 
horizontal and vertical subsampling. 

On an overall basis, the picture quality produced by this 2-Mb/s 
codec is felt to be equal to the quality of the input Phase I codec 
signal. 
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A DC-to-2.3-GHz Amplifier Using an 
“Embedding” Scheme 


By GERARD WHITE and GEN M. CHIN 
(Manuscript received July 26, 1972) 


A novel circuit technique ts described for embedding a high-frequency 
amplifier in a low-frequency circuit to achieve a defined, flat gain from dc to 
the cutoff frequency of the hf amplifier. The technique provides this low- 
frequency gain without compromising the hf design optimization. An 
embodiment of this technique is described which has provided, experi- 
mentally, amplifier gain from dc to a half-power point of 2.3 GHz. 


I. INTRODUCTION 


This paper describes a novel circuit technique for providing broad- 
band amplifier gain response from de to extremely high frequencies. 
The technique provides this wide spectrum response without com- 
promising either the hf response or the de stability. This performance is 
obtained by “embedding” a parameter-optimized hf amplifier within a 
de gain-defining circuit, and providing means for ensuring a smooth 
transition between the low-frequency to high-frequency operating 
modes. This technique avoids the compromise of hf performance which 
is frequently present in direct coupled amplifiers.' 

Amplifiers with response to de are frequently required in communica- 
tion systems employing a baseband Pulse Code Modulation (PCM) 
type encoding scheme where the entropy of the information signal is 
unknown. Because of the simplicity afforded by binary PCM, its use 
has been adopted in many optical systems.’ In such cases, the channel 
information rate is restricted mainly by the bandwidth of the electronic 
driving circuits. The economics of noncoherent optical systems again 
dictate the use of extremely broadband amplifiers and, if the system 
simplicity is not to suffer,> a gain response to de is required. Such 
systems should benefit from an hf optimized amplifier providing gain 
to de. A particular realization of the embedding technique is described 
in this paper which may find use in such systems. This realization 
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provides essentially flat gain response up to approximately 0.5 of the 
constituent device common base cutoff frequencies. The technique 
results in an amplifier voltage gain of 8 dB over the range de to 2.3 
GHz with a step response rise time of 200 ps. At frequencies below the 
pre-cutoff resonance, inband ripple is typically less than 1.5 dB. 


II. THE EMBEDDING TECHNIQUE 


Amplifier embedding is applicable to a number of circuit realizations 
but is best described in terms of the simple common emitter stage 
shown in Fig. 1. At de, this stage exhibits a gain of 


Gac > 


Ru + Rm 


and at high frequencies the gain is simply Gur = Ro,/Rz,, so that, 
by making the equality 


Ra + Re 
Rm+ Rm 


= Ra/Rr, 


HIGH FREQUENCY 
AMPLIFIER 


LOW FREQUENCY GAIN 
DEFINING CIRCUIT 





Fig. 1—Embedded common emitter stage. 
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Fig. 2—Amplifier gain spectrum. 
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the two portions of the gain spectrum, shown in Fig. 2, will be equal. 
In addition to this requirement, a flat transitional crossover is required. 
This critical crossover point has always been the major difficulty in 
split-band additive amplifiers.© To analyze the requirements for a 
flat crossover, it is more meaningful to consider the practical circuit 
arrangement shown in Fig. 3. The circuit also incorporates emitter- 
follower buffering stages at the input and output. The additional 


—HIGH FREQUENCY 


Fig. 3—Practical hf amplifier embedded in a low-frequency gain-defining circuit. 
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capacitances are for minimization of the lengths of the constituent 
current loops in the circuit for good high-frequency gain performance 
and the improvement of stability by effecting a reduction in the points 
of interaction of these loops. The condition for a flat gain spectrum 
can be analyzed using the simple equivalent circuit shown in Fig. 4. 
The simplicity of this circuit is afforded by the frequency of crossover 
(approximately 83kHz) being much less than the cutoff frequency of the 
transistor so that a low-frequency model is permissible. 
The transfer function of this circuit is readily shown to be 


Vo Rall + ste 
Vin ~ Rn F Ir | 
F Rui Bed 2878) 1D = eC pha sen eC Ra] 
, Lie ey Deed nea 
where 


D = (1 + sre) (1 + 87z) + SRoCe(1 + stz) + SRmCp(1 + sre) 





Cohe. “CeCe 46s 


I 


TC 


TE= CoRm, Ce =C,4+ Cr 


for 
Rn Rn + Rr 


Ra Rap coo, 


It is clearly seen that by making rz = 7¢, the poles and zeros of the 
system cancel, thus producing a flat crossover. The value of Cg is 
seen to be unimportant. Large values of Cg will produce a dominant 
pole and zero which will tend to mask the required equality of re 
and TE. 


and al. 





Fig. 4—Equivalent circuit of embedded gain cell. 
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Re 





Fig. 5—Embedding technique applied to (a) modified cascode stage, (b) shunt 
feedback stage. (Gain equality requirement indicated for lf and hf cases.) 


The consequence of separating the If and hf gain-determining 
components is that the lf portion may be designed to ensure a high 
de stability (e.g., Rz, large) without impairment of the hf optimization. 

The technique has more general applications to any gain stage where 
the stage gain is defined by the ratio of two real impedances; these 
further applications of embedding are shown for the modified cascode 
and shunt feedback stages of Figs. 5a and b. The realization of Fig. 5a 
is particularly important since the optimization of the high-frequency 
circuit parameters leads to frequencies of operation where the elimina- 
tion of the Miller effect afforded by this circuit is important. Also, 
the low external emitter impedances presented provide an enhanced 
stability, important at these high frequencies. 

These circuits constitute a class of split-band additive amplifiers. 
Split-band amplifiers were first described by Wheeler’ many years ago, 
but their successful realization has been retarded by the difficulties of 
achieving a satisfactory crossover mode in extremely high-frequency 
amplifiers. The embedding scheme provides a crossover transition at 
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relatively low frequencies so that its physical synthesis is straight- 
forward. Although the internal gain is provided in a product sense, the 
embedding technique is essentially additive, and thus related to the 
distributed amplifier of Percival® and the Gilbert® gain cell. 


III. A DC-TO-2.3-GHZ EMBEDDED AMPLIFIER 


The embedding principle has been embodied in a broadband amplifier 
of the type shown in Fig. 3. This amplifier has been designed for 
optimum hf performance. To demonstrate the way in which this 
optimization is achieved without compromising the low-frequency 
stability, it is pertinent to indicate briefly the hf design optimization. 
The design procedure is based on the simplifying assumptions that the 
major frequency-limiting mechanisms are the frequency fall-off of the 
transistor current gain a (modeled as a single pole fall-off), and the 
collector-base capacitance of the gain transistor (this being assumed 
to be the dominant parasitic reactance). For simplicity, both effects 
are evaluated separately. The consequence of the former effect is 
evaluated by substituting the single-pole approximation for @ in the 
gain transfer function for the stage, assuming negligible loading at the 
output of the gain transistor (this latter approximation is justifiable 
since the practical realization employs double emitter follower buffer- 
ing to the output). The gain transfer function is 


Vo r ake, | Rz,/(1 — a)? | 
Vi Ruz, LRz/(1 — o)? + Rd 

R, being the source resistance. Upon substitution of the single-pole 

approximation to a, and making the further simplification that ao, the 


low-frequency, common-base, short-circuit gain, is equal to unity, the 
following equation results, 


Ver. =| (1 + 87a) | 
Vi Rell +2s7.+ (14+ R./Rz,)s*r0 
where 7, is the time constant associated with the single-pole approxima- 


tion to the frequency-dependent current gain (ra=1/wr). The 
dominant time constants here are a conjugate pair of poles at 


= 1 : VR./Rz, 
——————— + Ya ron . 
Le Ri/ Re, ees, 








Pi, P2 = 


The position of these poles, normalized to 7,, is shown in Fig. 6. 
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Fig. 6—Pole-zero loci for Rz, variations (normalized to 7a). 


Selecting a normal Butterworth response (Q = 0.7) gives 
1 2. VR./Rz, 
1+R/Re, 14+ #./Rz, 


This may be interpreted as an upper limit on Rz, which, of course, 
prohibits the achievement of a high de stability factor. Practically, 
the upper value of Kz, is restricted by considerations of providing 
high-stage gain while maintaining a corner frequency, due to the 
collector base capacitance, of not less than the corner frequency due to 
high-frequency fall-off of the current gain. The devices used exhibited 
an fr of 4 GHz and a C,.» of 0.25 pF. This output capacitance (plus 
other additive capacitance due to loading) restricts the value of 
Re, to 100 Q for a stage gain of 8 dB, with the concomitant effect 
that Rz, (including the dynamic internal emitter resistance) be ap- 
proximately 30 ohms. For efficient quiescent point definition, and 
hence de stability, the value of external emitter resistance should 
be of the order of 1000 ohms. These parameter values allow a com- 
plete description of all other parameter values used. The embedded 
amplifier was designed with a band transition frequency of approx- 
imately 3 kHz; this allows easily realizable synthesis. The actual 
circuit was fabricated using a hybrid technology consisting of tantalum 
nitride and gold thin films on alumina substrates with beam leaded 


or Rr, = Rs. 
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Fig. 7—Embedded amplifier in thin-film, beam lead technology. 


device chips. A photograph of this realization is shown in Fig. 7. The 
pulse response (see Fig. 8) exhibits a 200-ps rise time with a gain of 
8 to 10 dB; the frequency response (Fig. 9) shows a flat gain curve up to 
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Ba 
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Fig. 8—Embedded amplifier pulse response. Upper trace, input. Lower trace, 
output. (200 mV/cm, 100 ps/cm.) 
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2 GHz 
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Fig. 9—Eight-decade gain response of broadband embedded amplifier. 


2 GHz with the exception of some parasitic resonance-induced devia- 
tions just below 2 GHz. The conjugate pole resonance at 2 GHz is also 
apparent at the cutoff. This should easily be eliminated by choosing a 
suitable ratio of R;/Rz, to provide a subcritical Q factor. 

The effect of mismatch of the emitter and collector circuit time 
constants, tc and rz, is shown in Fig. 10. These waveforms illustrate 
the effects of gross mismatches where the relaxation times observed 
are commensurate with a 3-KHz transition frequency. 


IV. CONCLUSIONS 


A technique has been described for providing gain to de in hf 
amplifiers without compromising the hf circuit optimization. This 





Fig. 10—Mismatch of emitter and collector time constants. Upper, rz<rc. 
Lower, rz=rc. (5 us/cm.) 
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embedding technique can be applied to a number of types of gain 
stage. The common emitter stage, from which results have been 
obtained, exhibited a de-to-2.3-GHz bandwidth with a voltage insertion 
gain of approximately 2.6. The gain was restricted to this value to 
minimize the effects of collector-to-base junction capacitance. With 
other stages, such as the modified cascode, this restriction is not as 
severe, and much higher gains are to be expected. The high-to-low- 
frequency transition is easily realized, in contrast to other split-level 
amplifier configurations. 

The enhanced de stability available from the embedding technique 
favors a cascading of individual gain cells to obtain larger amplifier 
gains. It is worth noting that the freedom of choice of device operating 
point inherent in the embedding technique allows an optimization of 
biasing conditions for low-noise operation, appropriate to post-detec- 
tion amplification in optical PCM terminals. 
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Coupling Coefficients For Imperfect 
Asymmetric Slab Waveguides 


By D. MARCUSE 
(Manuscript received July 6, 1972) 


This paper presents a collection of formulas that are necessary for the 
treatment of radiation and mode conversion phenomena of imperfect 
asymmetric slab waveguides. The coupled mode theory of dielectric wave- 
guides is briefly reviewed, and general expressions for the coupling coeffi- 
cients are given. The field expression of the guided and the radiation TE 
and TM modes of the asymmetric slab waveguide are stated, and are used 
to derive formulas for the coupling coefficient for slight core boundary 
trregularities. 


I, INTRODUCTION 


Mode coupling phenomena and radiation losses caused by core- 
cladding interface irregularities have been studied extensively for 
symmetric slab waveguides and for round optical fibers.‘~’ These 
results are not immediately applicable to the asymmetric slab wave- 
guides used in integrated optics circuits. It is the purpose of this paper 
to collect the formulas for the normal modes of the asymmetric slab 
waveguides, and for the coupling coefficients between guided modes 
and guided and radiation modes caused by core boundary irregularities 
of these waveguides. 

The coupling coefficients between guided modes are useful for the 
design of distributed feedback sections for lasers and for an evaluation 
of unintentional mode coupling caused by core boundary roughness. 
The results collected in this paper are further necessary for the evalua- 
tion of radiation losses caused by core boundary roughness. 

Because of the many parameters that enter into the theory, it is 
impossible to evaluate the formulas in graphical form for all cases of 
practical interest. This paper is thus a collection of the required 
formulas which the reader can use to evaluate his particular problems. 
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II. SUMMARY OF THE COUPLED MODE THEORY 


The coupling theory is based on an expansion of the solution of 
Maxwell’s equations in terms of normal modes. The general theory of 
the mode expansion and the derivation of the coupling coefficients have 
been published by A. W. Snyder.?? His theory is based on local normal 
modes. Local normal modes resemble the modes of the ideal asymmetric 
slab waveguide with perfect core boundary. However, the boundary 
of the perfect guide is allowed to change in such a way that it coincides 
with the actual deformed core boundary at the particular point z along 
the waveguide axis at which the coupling coefficients are to be eval- 
uated. Since the waveguide width parameter is no longer a constant, 
the local normal modes are not themselves solutions of Maxwell’s 
equations. They must be superimposed with z-dependent expansion 
coefficients to form such a solution. The fact that these modes form a 
complete orthogonal set and coincide with the modes of a fictitious 
waveguide, the width of which is locally (at the point z under consider- 
ation) the same as that of the deformed waveguide, explains the name 
“local normal modes.” It is also possible to express the general field 
in terms of the modes of the ideal waveguide, the constant width of 
which differs slightly from that of the actual waveguide. This expansion 
suffers from convergence difficulties that are caused by the fact that 
the normal components of the electric field are discontinuous at the 
core boundary. The modes of the ideal guide are discontinuous at the 
dielectric interface of the ideal guide which does not coincide with that 
of the actual guide. The expansion in terms of ideal modes of the 
waveguide is thus discontinuous term by term at a point where the 
entire series must be continuous, and furthermore, it must describe a 
discontinuous field at the interface of the actual waveguide at a point 
where each individual term of the expansion remains continuous. The 
expansion in terms of local normal modes, on the other hand, describes 
the field discontinuity with a series, the individual terms of which 
are discontinuous in just the right way at the point of discontinuity of 
the entire series. The convergence behavior of this latter expansion 
can thus be expected to be superior to the expansion in terms of ideal 
modes. 

The electric and magnetic fields of the imperfect asymmetric slab 
waveguide are expressed by the series expansions 


4) 4) 


EBSYG 2 +e &) (1) 


). (2) 


(+) ) 


) jo 
H=)> (¢ ay +c, &#, 
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The expansion coefficients c‘*? and cS are functions of the length 
coordinate z. The superscripts (+) and (—) indicate waves traveling 
in positive and negative z-direction. The sums in (1) and (2) are 
symbolic representations of a summation over guided modes plus an 
integration over the radiation modes of the continuum.*® In order to 
simplify the notation, both sum and integral are indicated by the 
same symbol. In the integral, the summation index » is replaced by 
the continuous variable v, and the sum must be understood as the 
integral 


- / dv. (3) 
y 0 
The local normal modes £, and H, are solutions of the equations 
ip, (e, x 5e,*”) + Vi x roa a Iwegn? eo” (4) 
Fis,(e, X & )+V:X & = — taped. (5) 


The upper and lower signs and superscripts belong together. The 
symbols appearing in these equations have the following meaning. 


8, = propagation constant of mode » 

e, = unit vector in z-direction 

V,; = transverse part of the operator V 

w = radian frequency 

€o9 = dielectric permittivity of the vacuum 

Mo = magnetic susceptibility of the vacuum 

n = dielectric constant of the waveguide [n = n(za, y, 2) ]. 


Substitution of the field expansions (1) and (2) into Maxwell’s equa- 
tions and use of the orthogonality relations [see (9) ] lead to the set 
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Fig. 1—Sketch of the asymmetric slab waveguide with distorted core boundaries. 


66 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1973 


of coupled wave equations?:** 


(+) 


de, 


= = Bee +E Kee” + Keo) (6) 
te 

) (-, mes 
es = Bye, — 2 (Ky "Go Ky ‘ey os (7) 


The coupling coefficients have the form?:* 


(+) 
(4.0) | 0&, ‘a 
K y Ss + x Hig *@z 
: =i. ( 02 e 
(+) 
* OK, 
~p( 8” X : ) see} ae. (8) 
Z 


The asterisk indicates complex conjugation. The superscript p stands 
for (+) or (—) while the factor p assumes the values +1 and —1. 
P is a normalization parameter which is related to the power carried 
by the modes via the relation 








ae) +)" 
- (& XH, )-edx = Pé,,. (9) 
2 Jw 
The symbol 6,,, indicates the Dirac delta function if both » and »’ 
represent continuous variables, it represents the Kronecker delta 
symbol if both » and v’ are discrete labels, and it is zero if one subscript 
belongs to discrete modes while the other indicates a mode of the 
continuum. 
The coupling coefficients (8) are not very easy to evaluate since they 
are expressed in terms of derivatives of the mode functions. A. W. 
Snyder’ has shown that the coupling coefficients can be transformed 


to the following more useful form. (8,-— = —£,“) 
Wwe 2 On? * 
a = : seg de (10) 


— 4P(B® — B®) J_, dz 
BL y 


The coupled wave equations (6) and (7), with the coupling coefficients 
(10), provide an exact description of imperfect dielectric waveguides. 
The one-dimensional integral in (10) constitutes a specialization to a 
two-dimensional problem in view of our interest in the asymmetric 
slab waveguide. By extending the integration over the cross-sectional 
x, y plane, the general coupling coefficients are obtained. 
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For our purpose, it is advantageous to derive approximate coupling 
coefficients for asymmetric slab waveguides with discontinuous index 
distributions. We use the fact that the normal component F,, of the 
electric field obeys the relation 


mE 7 naB ere. (11) 


It is shown in Fig. 1 that ni and ne are the values of the refractive 
index at either side of the interface. In order to derive the desired 
expression for the coupling coefficient, we assume that the discon- 
tinuous index distribution is smoothed out (in an arbitrary way) into 
a continuous distribution. We assume that the wavelength of the 
radiation is very much larger than the region over which the refractive 
index varies continuously and write 
2 


ny 
Ey = Ey. (12) 
n? 


We show in the appendix how the integral in (10) can be evaluated and 
obtain the result 


2 
(4.7) WEo 2 2 df fT ni (4)* () (4)* (p) 
pp my nee oe es | (n — N2)— | Se Eve + Euy Evy 
4P(B — Bi?) dz ns 
B 
dht ny 
(+)* | (p) 2 2 M1 (4)* (() 
+ Guz Eve | mat (m1 ae N3)— Fe va 
a=f (2) dz 


n? 
3 


+ Ey Sy + 83? | (13) 
z=—d+h (z) 


The index distribution can now again be considered as discontinuous. 
The functions f(z) and h(z) describe the deformation of the upper and 
lower side of the core boundary (see Fig. 1). The field components are 
taken inside the core region at the core boundary. The refractive index 
of the core is n; while nz and n; are the indices above and below the 
core region. The electric field components are related in the following 
way. 
(-) +) 


Eyz — Eyx 
6 = BL a 
6 eae 


The approximation involved in the coupling coefficient (13) consists 
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in using the x component H,, and z component £,, of the local normal 
modes instead of their normal and tangential components with 
respect to the interface. The approximation is valid provided that 
df dh 
— €l and —— <I, (15) 
dz dz 
For many practical applications it is sufficient to use an approximate 
solution of the coupled wave eqs. (6) and (7). In particular, for the 
calculation of the as loss coefficient, we use the approximate 
solution of (6) (u = 


o® (y= sitet att 


-exp| i [ore- a5" do Ju. (16) 


The coefficient c is the amplitude of the guided mode, the losses of 
which we want to calculate, taken at z = 0. The propagation constant 
G, is a function of z since it belongs to a guided mode in a non-uniform 
waveguide, 8, is independent of 2 since it belongs to a radiation mode. 
The relative power loss AP,/P, that mode » suffers in traveling from 
z= 0toz = Lis given in Refs. 5 and 6. 





AP, 1 aah |» 4p) 
iam [ Bale (17) 
y 0 


The sum in front of the integral sign indicates that we must add up the 
contributions of forward and backward traveling modes as well as 
the contributions from the various kinds of radiation modes that will 
be discussed in the next section. The integral extends over the range 
of p values that belongs to propagating (non-evanescent) radiation 
modes. We show below that the functional form of the radiation 
modes is not the same over the entire integration range. 


III. MODES OF THE ASYMMETRIC SLAB WAVEGUIDE 


We consider only the special case in which there is no field variation 
and no waveguide distortion in y direction. This fact is symbolically 
expressed by the equation 


a 
sieht): (18) 
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With the restriction (18), the fields of the slab waveguide can be 
classified as either TE or TM fields. The TE fields have only the 
following non-vanishing field components 


Ey, He, Hy. (19) 
The TM fields have the non-vanishing field components 
H,, E:, E:z. (20) 


It is assumed that the refractive indices of the waveguide are ordered 
in the following way 
ny > No = N3. (21) 


IV. GUIDED TE MODES 


The x and z components of the magnetic field follow from the EL, 
component by differentiation 








1 OH, 
H, = —- — (22) 
WHO 0z 
H, = — ; (23) 
Who OX 


The guided TE modes of the asymmetric slab waveguide are obtained 
as follows (the factor exp[7(wt — 6z)_] is always suppressed) : 


6, = A,e~ for 0f2%<0 (24) 
Y . 

&y = A,( cos Kx — —sin cr) for —-dsxs0 (25) 
K 

éy = A,( cos Ka ++ us sin cd eno for —-x<xs-d. (26) 
K 


The parameters appearing in these field expressions are defined by the 
equations: 


7 = B — nak? (27) 
g? = 6? — n3k? (28) 
2 = mk? — 6B? (29) 


k? = wequo. (30) 
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The propagation constant 8 is determined from the eigenvalue equation 


_ Ky + 8) 
Kk? — 70 


tan xd (31) 


The normalization of the mode is obtained by expressing the amplitude 
coefficient A, in terms of the power P carried by the mode. 


ep 
Seg eee (32) 


i(a +4 7G +7) 


The mode labels », that were used in the coupled wave equations and 
the field expansions, are suppressed. The modes are labeled according 
to the solutions of the eigenvalue equation (31). 


V. TE RADIATION MODES 


The propagation constants of the radiation modes do not have a 
discrete set of values. However, the asymmetric slab waveguide has 
different types of radiation modes. In the range 


nk SBS nok (33) 


we find only one type of radiation mode, the fields of which decay 
exponentially into the region three with refractive index n3, but are 
standing waves in the space above the waveguide with refractive 
index nz. We can visualize the physical mechanism for exciting these 
modes by assuming that a source at infinity sends a plane wave that 
impinges on the core of the slab waveguide under an angle that is 
given by 


cosa = ae (34) 
nok 


The incident plane wave penetrates into the core region but is totally 
internally reflected if the angle a stays in the range given by (83). 
This total internal reflection occurs because we assumed that n3 < ne. 
In the space above the core we find a reflected wave added to the 
incident wave supplied by the external source. This explains the occur- 
rence of standing waves in this region. It is not possible to find solutions 
of Maxwell’s equations satisfying the boundary conditions which have 
only traveling waves outside of the core region. 

In the range of propagation constants given by (33) we find the 
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following expression for the field of the radiation modes 


&, = A, cos px + “B, snpx for OS2%<m (35) 
p 

&,=A,cosox+B,sinox for -dSxs0 (36) 

&, = (A, cosod — B,sinad)e"4@t+) for -—x< 2S —d. (87) 


H, and H, are obtained from EF, with the help of (22) and (23). The 
constant B is related to the constant A by the equation 


A — to tan od 


B, = ———_—-A, (38) 
A tan od + to 


and the parameters appearing in these equations are defined as follows 


een ee (39) 
PS ak Se (40) 
A? = ngk? — 2. (41) 


Note that A is imaginary for 6 values in the range given by (838). 
It is convenient to identify the parameter p with the mode label v 
to label the radiation modes. We thus use the identity 


51 = 8(p — p’) (42) 
in (9) and find for the amplitude coefficient A the relation 


A 2 
#(o cos od + — sin cd) 


4 





2 4wyoP 


A 
7|8| 


SSS a (48) 
A A z 

re cos od + —sin od) + a(o sin od — — cos cd) 
t a 


With @ in the range (33) we find that p is confined to the region 


0S pS (nz — na)th. (44) 


Next we proceed to list the radiation modes that belong to propagation 
constants in the range 
OS 6BSnzxk 8B real 


and ons ; (45) 
0s |B| < £8 imaginary 


72 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1973 


The corresponding range for p is given by 


Crean A area e (46) 


For real values of 8, these modes propagate along the z axis while they 
are evanescent waves in z direction for imaginary values of 8. It is 
again possible to visualize the modes in the range (46) as being excited 
by a source outside of the waveguide core located at infinity. This 
source sends a plane wave toward the slab whose angle of propagation 
with respect to the z axis is given by (34). However, there is now no 
longer total internal reflection at the lower boundary of the core so 
that we obtain an incident and reflected (in x-direction) wave in the 
space above as well as inside the core. Below the core there is a trans- 
mitted propagating wave. However, we may now assume with equal 
justification that a second source sends a plane wave in the direction 
of the core from below. If both sources are turned on simultaneously, 
we obtain standing waves (in z-direction) below as well as above the 
waveguide core. The exact form of the radiation field depends on the 
relative phases between the two sources. It is thus not surprising that 
there should be an infinite number of ways in which orthogonal sets 
of radiation modes can be constructed. 

We list only the EZ, components of the modes and obtain the H, and 
H, components by differentiation from (22) and (28). (¢ = 1, 2) 


&y = C4( cos pt + "F. sin ne) 0OsS24<aH (47) 
p 


&, = C,(cosox + F;sinox) -dSxs0 (48) 


by 


c,| (cos od — F; sin cd) cos A(x + d) 


+ —(sin od + F;coscd) sin ate +0)| —-oe<xrs-—d. (49) 


The parameters c, p and A are given by (39), (40) and (41), A is now 
a real constant. Whereas the amplitude coefficient B, in (85) through 
(87) was related to A, by the boundary conditions, we now face the 
situation where the amplitude coefficient F; remains arbitrary. Equa- 
tions (47) through (49) satisfy Maxwells’ equations as well as the 
boundary conditions without any further restriction having to be 
imposed on the coefficient F’;. This freedom of choice is related to the 
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arbitrary amplitude and phase of the two plane wave sources of the 
radiation mode mentioned above. We must choose F; in such a way 
that two radiation modes with the same value of the propagation 
constant, but different values of F; become mutually orthogonal. But 
even this requirement does not specify the possible values of F; 
uniquely. We are thus free to choose F' values according to our own 
convenience. Of the infinitely many possibilities, we choose the F, 
coefficients such that in the limit nz = n3, even and odd radiation modes 
result. In the asymmetric slab waveguide no even or odd modes exist. 
But the guided modes become either even or odd as n2 approaches 
n3. We obtain the same symmetries for the radiation modes by a 
suitable choice of the F; coefficients. 


1 A 
Fy. = ————————_ 7 (a? — A?) cos 20d + —(c? — p’ 
bi (a? — A?) eal pens 22 se p*) 


= | ( — A’)? + 94 (62 — A?)(a? — p*) cos 2ad 
p " 
+S@— or] }. 60) 


The + sign (— sign) belongs to the odd (even) mode in the limit 
N2, = N3, A = p. We identify the mode label v again with the transverse 
propagation constant p, and obtain from the normalization condition 
(9) and (42) the relation between the amplitude coefficient C', and the 
power P 





A 
| —(cos od — F; sin od)? 
p 
fxg o —1 
+ —(sin od + F; cos ocd)? + 1+ “re| . (51) 
Ap p* 


We have thus obtained two distinct sets of radiation modes, the 
propagation constants of which lie in the range (45). The two sets are 
distinguished by the labelsz = 1 and7z = 2. F, and F: follow from (50) 
if we use both signs of the square root expression. It is noteworthy that 
the following relation holds. 


This listing contains the complete set of TE guided and radiation 
modes consistent with the restriction (18). All modes are mutually 
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orthogonal to each other. We have concentrated on the forward 
traveling modes. The backward traveling modes are obtained simply 
by changing the sign of 6, (@@ = —B), 


VI. GUIDED TM MODES 


The TM modes are very similar to the TE modes except that the 
roles of the field components are interchanged. We now list only the 
H, component and obtain the H, and EL, components of the field by 
differentiation 


i OH, | 
E, = (53) 














Nweo 0z 
—¢ dH, 
E, = (54) 
N*weo OX 
Ky, = Dev? 0OS2r<0 (55) 
2 
mY, 
Ry = D,( cos ke — —-—sin cc) —dsxs0 (56) 
n? Kk 
2 
2 
MAY 
Ry = D,{ cos Kd + =a Sul nd ener —o< xs —-d. (57) 
n? x 


2 
The parameters x, y and @ are defined by (27) through (29). The 
eigenvalue equation is 
2 ( 2 ae 20) 
nik(n n 
tan kd = inl Ae (58) 


nin? — niy8 
2 3 1 


and the amplitude coefficient is given by 











D: = 4weoP 
|B | 
nation? 
1N2 
ee 
4 4 nine ey? ning K? + 6 
(nox? i nev) d + Te a oe 
Y mie + ny? ] nie + nie? 
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VII. TM RADIATION MODES 


The TM radiation modes must again be split up into two ranges. 
In the range 0 S p S (n3 — n3)k, the fields have the form 


n 
5K, = D, cos px +—-G, sin px O0OS24%<a (60) 
n* p 
1 
ay = D,cos ox + G,; sin ox —-dsx<0 (61) 
KH, = (D, cos od — G, sin ad)e—*4 +4) —-exsxes-d. (62) 


The boundary conditions require that G, is related to D, in the following 
way: 
2 oh 
nA cos od — in3o sin od 


= (63) 
na sin od + nie cos ad 


The parameters o, p and A are defined by (39) through (41), —7A is 
a real positive quantity. The amplitude coefficient D, is related to 
the power coefficient P 


2 AnweP 1 
= ee ee en) 
Ne o? G, 
ieee 


In the range (n2 — n2)*k S p S © the radiation modes have the form 
(7 = 1, 2) 
2 
ne o , 
Ry = S,( cos px + a —-R;sin re) OS4%<a (65) 
n? p 


1 


a, = S-(cosozx + R;sinor) —dsxs0 (66) 
Ry = S,| (cos od — R;sin cd) cos A(x + ad) 
2 
n3 oo, , ‘ 
+ = ecas od + R; cos ad) sin A(x + d) 
n 


1 


—~ex< xs —-d. (67) 
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The coefficients R; are again arbitrary. Our choice 
1 


7 (nto? — n‘A?) sin 2od 


Ri. | nie — nA?) cos 2cd 


r 
nz A 4 4 4 4 m3 A? 4 4 
+ ——(neo? — nip?) + | (n30? — ny A?)? + — —(ne0? — nyp?)? 
ne n! p? 
2 


2 


ns A 4 4 4 4 y 
+ a —(n30? = nA?) (neo? = np?) cos 2ad (68) 
n? p 


2 


causes the modes with index 7 = 1 to be orthogonal to the modes with 
index 71 = 2 and, in addition, assures that these modes become even 
and odd in the limit nz = n3. [The + sign (— sign) belongs to the 
odd (even) mode. ] The normalization is given by 


2 4weoP 1 ns a 2 ns o? : 
S, = — + ——R; + ——(sin od + R; cos od)? 
r|B| n* ns p? ni pA 





1A i‘ 
+ ——(cosod — R;sinod)?} . (69) 
n? p 


3 


All amplitude coefficients for the TE and TM modes were taken to 
be real quantities. This assumption does not lead to a loss of generality 
since the necessary phase factors are incorporated in the expansion 
coefficients c,. 


VIN. COUPLING COEFFICIENTS 


With the help of the expressions for the normal modes and the 
coupling coefficients (10), any problem of the asymmetric slab wave- 
guide with arbitrary irregularities of its refractive index distribution 
can be solved, provided that the restriction (18) is imposed. Problems 
caused by core boundary irregularities or by gentle tapers can be solved 
with the help of the coupling coefficients (13). For convenience, a few 
coupling coefficients will be worked out explicitly. 

As long as the restriction (18) applies, TE modes do not couple to 
TM modes. All coupling coefficients between TE and TM modes 
vanish. We restrict the discussion to listing the coupling coefficients 
between guided TE modes, guided TM modes, and to coupling from a 
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guided TE or TM mode to its respective radiation modes for the case 
of core boundary irregularities. 

The coupling coefficients between two guided TE modes can be 
obtained from (18) and (25). (p = + or —) 


[= sin x,d sin x,d =| 
xem Lda [sim esi | as 
- (4) (p) 1 1 1 1\73 
(6, —B | 1848.1(a Das +—)(a oo -)| 
Yu Ou Yr 6, 


The eigenvalue equation (81) was used to express (70) in this simple 
form. This coupling coefficient (and all others to be listed below) holds 
for the special case f(z) XK a/x, h(z)<K a/x with x of (29). Instead of 
using the values of the field at x = f(z) and x = —d + h(a), the field 
values at x = 0 and x = —d were used. In order to see the agreement of 
this coupling coefficient with the coupling coefficient for the symmetric 
case [eq. (7) of Ref. 7], it is necessary to note that the core thickness 
d of this paper corresponds to 2d of Ref. 7. In addition, we need to 
keep in mind that only the Fourier components of f(z) and h(z) 
with spatial frequency ‘hag — B® contribute to coupling between 
modes yw and ». The derivatives appearing in (70) are thus equivalent 
to the products 7(@(* — pg) f(z) and i(e{* — p)h(z). Keeping 
these remarks in mind, complete agreement of (70) with (7) of Ref. 7 
is obtained for the special case ne = nz, y = 8. 

The coupling coefficient for coupling between a guided TE mode »v 
and a TE radiation mode p follows similarly from (13), (25), and (36) 
for radiation in the range 0 S p S (n3 — n3)tk 


2 = A . 
(mn — ni) ono 0 cos od + —sin od) 
1 


(B+) — B?) 


df  sinx,d € = =) é dh 
i | sin x,d| ne — ne A * 


2 «cos od + — sin od 
a 


1 I A 2 
. {18.61 (« +—-+ GG cos od + — sin od) 
Yr 


v. t 


A a7) —3 
+ oi(o sin od — — cos od) | e(1) 


a 
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The coupling coefficient of the guided TE mode » to the TE radiation 
mode p in the range (nz — n3)!k S p < © is given by 


2 2 
(+1?) (n1— Ne) *kk, 
Kor map 


i 1\3 
(a — 8] w18,6.1(a+—+—) 
Z : yw 4% 
2 


df sinx,d (eas 


dz |sin x,d| 





3 dh 
) (cos od —F; sin od) — 
n?—n? dz 

1 2 
A o ao » |? 2) 
| (cos od —F; sin od)?+—(sin od+F;; cos od)414+F | 
p pA p” 
The factors Ff; and F2 are obtained from (50). The radiation modes do 
not all propagate along the z-axis. Propagating modes are confined to 
the range 0 S p S nok. 

The reader should not be startled by the fact that the coupling 
coefficients (70) have the dimension m=! while the coupling coefficients 
(71) and (72) have the dimension m-?. The different dimensions are 
attributable to the fact that the coupling coefficients between guided 
modes occur under a summation sign while the coupling coefficients 
that describe coupling to radiation modes occur under an integral sign. 
The integration is performed with respect to p, the dimension of which 
is m~!. The product c,K ,,dp has the dimension m=! in agreement with 
the dimension of the coupling coefficients for guided modes.* 

Finally, we list the coupling coefficients for the TM modes. Coupling 
between guided TM modes is described by the coupling coefficient 
(p = + or —) 


2 2 
Seat (ni — Ne) DoD gu | 2 (4) (p) 2 df 
K < a paige oe stance), AOnra r N1YVy —= 
H 4AP(g 25 B‘?))weonin' (nBy B + NY Yu) °F 

u y 








‘ é Z 2 4 2 4 2 4 2 4 2 
sin x,d sin k,d ni — ng be + nyyv) (Ney + ma | 


|sin «,d sin xyd| n? — n? L (ntx? + 46?) (mtx? -+ 146?) 
1 2 3» lp 3 lp 


dh 
(nsBy* By” + 118,8,) | . (73) 


The coupling coefficients for the TM modes are far more complicated 


t Note that /|c,|?d, is dimensionless so that the dimension of c, is m}. 
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than the corresponding coefficients for the TE modes. To simplify the 
notation, we did not substitute expression (59) for the mode amplitude 
into (73). 

The coefficient for coupling from a guided TM mode to a TM 
radiation mode in the range 0 < p < (nz — n3)3k is 





2 2 

(4.2) (ni _ n2)D Dro {Ost =~) df 
v =. Ter y — 6 Somer sae 
: 4P(B® — B)weon?n? |W" Way ae 








: 2 2 42 42 
sin K,d Ni — Ng] Neky + NY» 4 (+) (p) Grp . 
————|1186, 8, (cosod — sin od 
nn -+- ne D 
v v 


[sin «,d| ne — ne rp 


+ 66 (si fe! i) || (74) 
76, { sina — cosa —}- 
Dy» dz 
The amplitude coefficients D,,, D-, and G,, are obtained from (59), 
(63) and (64). 

The coefficient for coupling from a guided TM mode to a TM 
radiation mode in the range (n} — n2)*k S p S » is given by 


2 2 
nin D ie d 
or ae ( 1 2) g p {12° a ov,R3) Os 
4P(B™ — B® )wegnin: dz 





4 2 2 42 4 2 
sink,d ny — n3[noky + niyy PT ca) cp) 
aS ——\—]||8, 8B, (cosod — R;sincd) 
|sin x,d| n* — n. nie? +. ni? 
Vv v 


dh 
+ o6,(sin cd + R; cos od) | =| - (75) 
2 


The amplitude coefficients D,,, S;,, and R; are obtained from (59), 
(68), and (69). The index 7 assumes the values 1 and 2, and corresponds 
to the two types of radiation modes that are distinguished by the + 
and — signs in (68). The superscripts, -+- and —, attached to the 
coupling coefficients are supposed to indicate whether the modes 
travel in + or — 2-direction. 

It can be shown that the coupling coefficients derived in this paper 
specialize to the correct expressions*:® of the symmetric slab waveguide 
in the limit nz = n3, y = A, p = 0.7 


t Equations (9.5-26) and (9.5-27) of Ref. 5 must be divided by 73, eq. (9.5-31) 
must, correspondingly, be divided by nj. 


80 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1973 


IX. CONCLUSIONS 


We have collected the formulas for the modes of the asymmetric 
slab waveguide and have used this information to derive the coupling 
coefficients between guided modes as well as between guided and 
radiation modes for the case of very slight core boundary imperfections. 
Also presented is the general theory of coupled modes of dielectric 
waveguides and the general formulas for the coupling coefficients. 
The theory collected in this paper is useful for the description of mode 
conversion and radiation phenomena. Phenomena such as the grating 
coupler and the interaction of acoustic waves and guided light waves 
can readily be treated with the theory presented here. For an applica- 
tion to statistical irregularities of the core boundary, the reader is 
advised to consult Refs. 5, 6, and 7. 


APPENDIX 
Evaluation of the Integral (12) 


We consider the index distribution of the slab waveguide as being 
smoothed out to avoid the discontinuity at the core boundary. It is 
assumed that the index varies only in a direction perpendicular to the 
core boundary. We use a coordinate system wz’, 2’, the axes of which are 
perpendicular, and parallel to the tangent at a particular point of the 
core boundary as shown in Fig. 2. In this coordinate system, we assume 
that the refractive index is of the form 


n? = F(z’). (76) 


For values of z’ S$ 0 we have F = n?; for values x’ > — we have 
F = nj. At the end of our discussion, we let & — 0. The scalar product 






> ~ TANGENT OF 
CORE BOUNDARY 


Fig. 2—Sketch of the coordinate systems used for the evaluation of integral (10). 
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of the electric field vectors can be expressed as 
E.'S) = Cura + Ey Ent (77) 


The coordinate ¢ indicates a direction parallel to the core boundary. 
&,, is continuous at the core boundary and can be taken out of the 
integral. &,., on the other hand, is discontinuous. We express it in 
terms of the field just inside of the core and, using (12), write 


fin 


yz! = Eyz! z'=0- 78 
Fan ee) (78) 


The scalar product can thus be written in the form 


re 
* 


&,° & = F(2’) ) sane -00e Eyx! Jee 0 + 6 Bik: (79) 
We thus have to deal with two different integrals. We first consider 
the integral 


dx. (80) 








© OF (x’) ax’ f° OF (x’) 
| dz = —— 
0 02 dzJo oa" 
We obtain from Fig. 2 the relations 








Ox’ ; 

— =sina (81) 

0z 
and 

da’ 
dx = : (82) 
COS a 
The integral can thus be evaluated 
OF (x’ 
[ ~ oe = [F(é) — F(0) ] tana. (83) 
0 


At the upper core coundary, we have tana = df(z)/dz, and at the 
lower core boundary we have tana = dh(z)/dz. Taking both core 
boundaries into account, we find with the help of (76), 


on? df 
ie re Ey 0% ea (ni _ ma) = (i801) 20s 
—wo O02 


+ (na _ ni) S = (BiuBrlennash: (84) 
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The integral associated with the normal field components is essentially 











of the form 
i 1 oF(z’) ae tone) [= 1 — aa 
0 F(x’) dz 
F(é) — a 
= (tan a) FOF® © ; (85) 


The integral containing the normal field components results in 


on? 2 ne df 
in nn Ea Eyed rs (nj ay Ne) as ~ (Enz Eve" \enp — as 
= Z 


0z n: 
2 


2 2 Ni dh 
+ (ny = N3) 2 (Gps Or 2) eeaek das (86) 


3 


In (13) we replaced x’ with x and ¢ with z. These approximations are 
valid provided that the inequalities (15) apply. The error is of second 
order in the derivatives of f(z) or h(z). 
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Batch Input to a Multiserver Queue with 
Constant Service Times 


By A. KUCZURA 
(Manuscript received July 26, 1972) 


Delay probability formulas for batch input to a finite number of con- 
stant-holding-time servers are derived under the assumption of statistical 
equilibrium. The service-delay distribution (delay until a first request from 
the batch enters service) 1s given in terms of the roots of a transcendental 
equation, while the probability of no service-delay and the average delay are 
expressed directly in terms of the number of servers, the holding time, and 
the parameters of the input process. A numerical example with a fixed 
batch size 1s discussed. 


I. INTRODUCTION 


Batch arrivals constitute an important class of input processes in 
the theory of queues. The investigation of the problem of batch input 
to a group of constant-holding-time servers was motivated by the 
existence of installations with multiple Automatic Calling Units 
(ACU). Customer-based computer equipment controlling the ACU’s 
is capable of originating simultaneous requests. The dial-tone markers, 
the first common control equipment in a No. 5 Crossbar central office 
to serve the requests, can be modeled as a group of constant-holding- 
time servers. 

Another example comes from an information transmission system. 
Messages containing a (small) random number of characters (a batch 
of characters) arrive according to a Poisson process and must be 
transmitted to some destination. Delayed messages are stored in a 
buffer. Since the transmission time per character is usually fixed, this 
system provides another example of the model studied. 

In Section II, the mathematical model used in this study is de- 
scribed and the input process defined; the state equations are written 
and used to derive the generating function for the equilibrium state 
probabilities. The probability of no service-delay is found in Section 
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III, while the average delay is computed in Section IV. The service- 
delay distribution is given in terms of the roots of a transcendental 
equation in Section V. A numerical example with a fixed batch size is 
discussed in Section VI. The effect of this batching scheme on the 
average delay and the service-delay probability is examined. 


II. MATHEMATICAL MODEL 


The model studied here is that of a queuing system with a finite 
number of servers, batch arrivals, and constant holding time. The 
assumptions are 


(t) Requests arrive according to a compound Poisson process, that 
is, requests arrive in groups or batches and the instants at 
which the batches arrive constitute a Poisson process. 

(22) There are c servers and each request has access to any one of 
them. 

(zit) All requests have the same constant service time, r. 

(iv) The delayed batches wait until service becomes available and 
are served in order of arrival. The service discipline for requests 
within a batch is arbitrary, i.e., not specified here. 

(v) The system is in statistical equilibrium. 


Systems with simple Poisson input have been studied as early as 
1920, when A. K. Erlang! obtained expressions for the probability of 
delay for arbitrary values of c and the average delay fore = 1, 2, and 3. 
The first complete treatment of such systems was by Pollaczek.?’ 
Crommelin,*> using a method which is simpler than that of Erlang 
or Pollaczek, also derived general formulas for the probability of delay, 
the average delay, and the delay distribution. A simplified and concise 
account of Crommelin’s work is given by A. Descloux,® who also shows 
how Pollaczek’s formulas can be deduced from Crommelin’s results. 
The development herein is an extension of Crommelin’s results to the 
case of compound Poisson input using the simpler methods employed 
by Descloux. 

We now define the input process. Consider events which happen in 
groups rather than singly, that is, requests arriving in batches at a 
group of c servers. For k = 1, 2, ---, let Ni(t) be a Poisson process 
with intensity \; which governs the arrival of k-sized batches. Assume 
independence of the processes N;(t), k = 1, 2, ---. Let N(é) be the 
total number of requests that have arrived in the interval (0, ¢]. 
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Then 
N(t) = © ENO (1) 


k=1 
is called a compound Poisson process (Ref. 7, page 271). 
The probability that an arriving batch is of size k is equal to A;/), 
where 


From eq. (1), we see that the mean and variance of the number 
of arrivals per unit time are 


Mi= Dk and = pe = DY K*Ag, (2) 


k=1 k=1 


respectively. The generating function of the probability distribution 
mi(t) = P{N(t) = n}, n = 0, 1, 2, ---, is given by 


m(t,z) = Do ma(tje” = e8@), 


n=0 


where 


B(z) = : Ane” — A, 


n=1 
and hence the probabilities 7, (t) are given by 


(Art) (Nat) *8+ + Ant) ** 
n() = ery, n= 01,2, 06, 3 
ii o Sail o 


where J, is the class of all sets of nonnegative integers {ki, ko, ---, kn} 
such that k, + 2k2 + --- + nk, = n. 

The expression given in (3) is not suited for computing 7, (¢). These 
probabilities are more conveniently computed from the recurrence 
relation 


t k 
Tit) = hed Dh G4 1)Ac-p i), k =0,1,2,---:, 
as (4) 


Tot) ee 
Equation (4) is easily obtained from the relation k!7;,(t) = +“ (t, 0), 
where the superscript denotes differentiation with respect to z. 


Special cases of the compound Poisson process are obtained by 
choosing different convergent sequences of the positive constants 
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A1, Ae, +++. One such sequence is obtained by setting A; = a/r*, 
r>1,j7= 1, 2, ---. This special example has become known as the 
‘stuttering’ Poisson process.’ In this case, a simple expression for 
mwn(t) can be obtained by noting that the generating function has a 
power series expansion in 2, the coefficients of which are given in terms 
of the Laguerre polynomials L», that is, 


z\ 2 zZ\" 
v1 em (1=2) F acon (2), 
T/7 n=0 r 


> OZ . 
B(z) = ea a 


since 


It follows that 
mwo(t) = e~*# 


—At 
Trlhs= — [L,(—otr) — Ly-1(—atr) ], nm=1,2,---. (5) 


We will now obtain the equilibrium state equations, and find the 
probability generating function of the stationary distribution for the 
general case. Let X(t) be the number of requests (waiting or in service) 
in the system at time ¢. Let 


pis(t) = P{X(t) = j|X(0) = 4} 


be the transition probability functions of the process {X(é), t 2 O}. 
It is clear that X(t) is not Markovian. If, however, we examine 


X, = X(kr), k=0, 1, 2, ae | (6) 
we see that this sequence is Markovian and, in fact, {X,,k=0, 1,2, ---} 
is a homogeneous Markov chain with one-step transition probabilities 

Dis = P{Xigi = j|Xe = 1}, = 0, 1,2, --:, 
given by 
1j(7), for OS17tSc 
Dij = 4 7j-s4c(7), for c<tSgjre 
0, for jgte<t. 


We will be interested in the distribution of the number of requests 
in the system encountered by an arbitrarily arriving batch (a batch 
arriving at a time point a long way from the origin, i.e., after statistical 
equilibrium has been reached). But since the instants at which the 
batches arrive constitute a Poisson process, this distribution is the 
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same as the stationary distribution 


Pi = lim pis(t), g oe 0, ty 2, Petry 


t->00 
of the process { X(t), ¢ = 0}. Moreover, if this limit exists, then so does 


lim P{X, = j|Xo = 7} 

k 00 
and they are equal. Consequently, the distribution of interest to us is 
given by the stationary distribution of the imbedded Markov chain 
(6). This distribution is obtained by solving the Chapman-Kolmogorov 
equations 


Po = T0(7)ae 


nt (7) 
Pn = Tn(T)Qe + ys Dien maelT); n= 1, 2, aces 
m=c+1 
where 
an = bs Pm 
m=0 


We will assume that wiz < c so that the stationary distribution {p;} 
exists. 

We need to solve the system (7) for the unknowns pn. To do this we 
introduce the probability generating function 


f@) = L pra” 
n=0 
Multiplying both sides of (7) by 2" and summing over n, we have 


f(2) = ae7F) YD. pergruas(r) 2” 


n=1 j=1 


oe a 
= a,ere e)-p >} >) Deri n—j(T)z” 


j=1 n=j 


1 
ge 
where 


g(z) = do pn”: 
n=0 


Thus the probability generating function of the sequence pn, 
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n = 0,1, ---, is given by 


g(2) — 2A 


f(z) = ; (8) 


— 2°e—78 (z) 


III. PROBABILITY OF NO SERVICE-DELAY 


We say that the service of a batch is delayed if upon its arrival all 
servers are busy. Hence, the probability of no service-delay will be 
defined by a-_;, that is, the probability of at most c — 1 servers busy. 
An explicit expression for this probability will now be found. 

We start with eq. (8). Since the p,’s are probabilities, f(z) is holo- 
morphic in |z| S$ 1 and, therefore, the zeros of the denominator and 
numerator in |z| S$ 1 must be the same and have the same multi- 
plicities. We will show that the denominator of (8) has c distinct 
roots in |z| S$ 1 and that all of them, with the exception of z = 1, lie 
inside the unit circle. 

For |z| = 1+6, with 6 sufficiently small and positive, we have 


wo 
[ev] < eat exp T | Ds ralel*| = e7 #1540 (52) 
k=1 


where p1 is defined by (2). 
Since tui < c by assumption, we have 


e7H15+0 (8?) < (1 -+ 5)° — lz|¢, 
and by Rouché’s theorem, the equation 


e7Bhz) —7= 0 


has exactly c roots within the region |z| = 1 + 6. Let these roots be 
denoted by 21, 22, «++, 2-1, 2(= 1). Then 
(z) 1, 21, 2, +++, Z—1 are distinct. 


(iz) |zn| <1 forn = 1,2,---,c— 1. 
To prove (2), first note that the root 2 = 1 is simple because 
1 — 2%e—7Als) 
lim ——————- = m1 —c ¥ 0- 
zl a— 1 
Similarly, for any root 2;,7 = 1, 2, ---,c — 1, 
1 ca 2ce— 78 (2) Cc— e 
lim. ———————- = 2; "e-78(2i) k > kee = e|. 
k=1 


ZZ 2 Ca 


BATCH INPUT TO MULTISERVER QUEUE 89 


The first two factors cannot vanish for any admissible choice of the 
root 2;, so that if 2; is to be a root of second or higher order, we must 
have 


(A121 + 2o2e + Brges +--+) =. 
But this is not possible since 
T{Asé1 + QAvz3 + 3Ag23 + ---| S tur <c, 
and the roots 1, 21, 22, «++, 2—1 are therefore all distinct. 
To prove (zz), suppose that |z,| = lforsomen,n = 1,2,---,¢—1, 


then |exp [78(zn)]| = 1 and hence the real part of 78(zn) must be 
zero, that is, RLB(zn) ] = 0. Hence we must have 


RB (zn) ] = B | = he (1 = :1)| — 0. 


Since all terms in the sum are nonnegative, we have ®(1 — 2*) = 0 
for all k, and therefore z, = 1, contrary to the assumption. It follows 
that |z,.| < 1lforn = 1, 2, ---,ce—1. 

Since the numerator of (8) is a polynomial of degree c, f(z) has 


eter Con CEE Com ener ey 
2— 1)(@ — 21) (2 — 22)- ++ (@ — 2-1 
f(@) =A [= zce—78(z) (9) 
The condition f(1) = 1 determines A: 
aces (10) 


Fe a a re ee 
(en) ea) SL seen) 


In computing a,_; it is convenient to introduce the generating function 


F(z) = D> ane": 
n=0 





Then, since @n — Qn-1 = Pn, Nn = 1, 2, ---, we have 
(1 — z2)F(z) = fe), 
or 
ipods. 
La 2 


Now, making use of (9), we obtain 


(2 — 21) (2 — 22)+++ (2% — 2-1) 


F(z) = —A aa 


90 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1973 


The probability of no service-delay, a._1, is given by the coefficient 
of 2°"! in the expansion of F(z): 


C= M1T 
(1 — z)(1 — Z2)° “9 (1 = 2Zc-1) 


An expression for a,_1 which does not involve the roots 2; is obtained 
through an application of the generalized argument-principle (Ref. 9, 
page 151). That is, suppose y (z) is holomorphic and ¢(z) is meromorphic 
on and inside the contour C’. Let a,, k = 1, 2, --+, be the zeros with 
multiplicities rz, and 6,, k = 1, 2, -+-, the poles with multiplicities s; 
of the function ¢(z) inside C. Then the generalized argument-principle 
states that 


(11) 


QAe-1 = —A = 





1 o (2) 
— | le) dz = Di rw(an) — Lo siv(Bx)- 
2rt Jo (2) k k 
Taking the logarithm of eq. (11), we have 

c—l 

log a.1 = log (¢ — wit) — DY log (1 — 2)- 

inn 
We will eliminate the roots 2; from the second term of the right-hand 
side of the preceding equation. Let 


¢ (2) = e@7Blz) — 2°, 


and note that ¢(z) has simple zeros at 2 = 21, 2, +++, 2-1. Choose 
y(z) as the principal branch of log (1 — z). The generalized argument- 
principle yields 


c—1 


1 
> log (ld — zn) = — | log (1 — z)d[log ¢(z)] = J 


where C’ is the contour |z| = 1 — e and e(>0) is chosen so that 
Zn, n = 1, 2, ---, c — 1, lie inside C but z,(=1) is exterior to C. We 


will now show that 


eo 1 ioe] 
J = log (¢ — muy) + & x  1;(nz), 
n=l j=ne 


and hence 
foo) 1 ioe) 
loga.1 = — LD - Le a;j(nr)- (12) 


n=l j=nec 


d 
Note first that the principal branch of log (1 — z) a [2 log (1 — z)] 
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is holomorphic in |z| S$ 1 — e. Since its integral on C is zero, we have 
1 — 2 cetB(z) 
J = re ; log (1 — z)d [tog —==— |. 
Integration by parts yields 


1 Leger tetPeI) | dz 
13 fo SE 
271 Jc Deng aaa | 





The integrand above has a simple pole at z = 1, and its residue there is 
equal to log (c — wir). Choose 6(>0) such that the only zeros of 
#(z) in the disk |z| < 1+ 6 are 1, 21, 2, ---, 2-1, and let C; be the 
contour |z| = 1 + 6. Noting that the integral of log (1 — 2“)/(z — 1) 
on C, vanishes, we have 





1 
J = log (¢ — pir) — =| log [1 — z-°e78(2)] 
2rt 


Cy z-—l1 


Now since |z~%e78)| < 1 on C,, the power series for log [1 — z2~¢e7#@) ] 
converges uniformly on Ci, and termwise integration is allowed, so that 





1 eo 1 g ne 
J = log (¢ — m7) +$— DY -f at ae dz 
2Tt n= N =i 


= log (¢ — wit) + r ~[1-> —— > i Crane verde | 


n=i 1 201 j=0 


Expanding the integrand in powers of z, and integrating term by term, 
we see that the integral is zero for all terms except one, and that there 
it is equal to 277 times the coefficient of 2~!. But the coefficient of 2} 
is exactly 7n-_;-1(n7), so that 


eo 1 ne—l 
J = log (¢ — wir) + u -[1- 2 7 ee unr) | 


tien 
= loge anit) ss (nr) , 
n=1 1 jane 
and this is the result stated earlier. 
For the case of a simple Poisson input, we have \; = A, A; = 0, 
j = 2,3, ---, and (12) reduces to [Crommelin, Ref. 5, eq. (5) ] 


(Ant)? 


oo 1 ao 
loga.1= — ) - ») —e”" 
net jee 4) 
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Note that this expression and eq. (12) differ only in the terms 7;(nr) 
and ((Anr)4/7!)e—***, which represent the same probabilities in two 
different systems: both are equal to the probability of exactly j 
arrivals in the time interval nr. As we shall see later, these probabilities 
appear again in the expressions for the average delay and the service- 
delay distribution. 

For “stuttering” Poisson input, eq. (12) reduces to 


e—lonrr] (r-1)) oe) 1 
————— > a [L;(—onrr) — Lj-1(—onrr) | 


june r 


logai1=—- > 
n=1 
where the L,(£) are Laguerre polynomials. 


IV. AVERAGE DELAY 


Under equilibrium conditions, the average delay D is equal to the 
average amount of waiting per unit of time divided by the average 
number of arrivals per unit of time. The average amount of waiting 
per unit of time is equal to 


L (n—C)pn, 
n=c+1 
so that D is given by 
- bo 
D=— Y (n-c)/p, =— LL npr —t 
H1 n=c+1 M1 n=0 


where 1 is the average number of arrivals per unit of time defined by 
(2). An explicit expression for D can be readily obtained by noting that 


id d 
ry np = in| S90 |, ljej<i1, 
n=0 zal dz 


where f(z) is the generating function given by (8). Straightforward 
differentiation of (8) and application of L’Hospital’s rule lead to 





D | ae ee | pot + mwit(uit — 1) — c(e — 1) 
—=—> + —- (13) 
T M17 iail — 2: 2uit(¢ — pit) 

where 21, 22, +, 2-1 are the roots defined previously. Again we wish 


to eliminate these roots. The method used in the previous section 
suggests the application of the generalized argument-principle with 
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(1 — z)— as ¥(z), and ¢(z) as before. Thus we have 
a ae | 1 1 ¢'(z) 








pe eee, De cl—2z ?(z) 





dz = K. (14) 


d 
Noting that the integral of (1 — "se {log [z1(1 — z)]} on C is 


equal to (c — 1)2z7 (the residue at the simple pole z = 0 isc — 1), eq. 
(14) becomes 


e 1) 4 1 | 1 afi i ea | 
es eee Lo hee 


Integration by parts yields 


K=( 1) 1 i ; - _ | dz (15) 
= is Loe Lae 





The integrand in (15) has a pole of second order at z = 1, and its 
residue there is found to be 


c(e — 1) — ryi(ru1 — 1) — The 


2(c¢ — ry) eeu) 
Consequently, 
K=- c(c — 1) — ryr(tu1 — 1) — The 
2(c¢ — ry) 
1 re is ak dz 
ales | 


Combining (14) and (16) and noting that the integral of 
(1 — 2) log (1 — 274) 
vanishes on Ci, we obtain 
D 1 
—-=- ul log[1 — 2-*e76()] 
Cy 


T 2ri1 


dz 
(1 — z)? 
Recall now that |z~°e76)| <1 on Ci, and hence log [1 — 2-¢e78@)] 


has a uniformly convergent power series representation in 2~°e7® on 
Ci, so that 





a ae (17) 


D eee pea 
: 


= Pie. dz 
bit 272 n=l NJ Cy, (1 = z)? 
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The integrand in (17) has poles at z = 0 and z = 1 with residues 
ne—1l 
> (ne — k)ai(n7) 
k=0 
and 
(wait == c), 


respectively, giving us the final result 


D tds 
— = — DO - D kaneyn (nr). (18) 


T B1iT n=1 7 k=1 


V. SERVICE-DELAY DISTRIBUTION 


For the purpose of obtaining the service-delay distribution (delay 
until a first request from the batch enters service), let us define g(t) 
as the probability that among the requests present at some time to, 
at most m of them are still in the system at time f) + ¢. Considering 
the state corresponding to gmc4c_1(t), we see that, at most, mc + ¢ — 1 
of the requests preceding the given batch will be in progress at time ¢ 
later, and consequently, at most, c — 1 of them at time m7 + ¢ later. 
This is the condition for the service-delay d to be less than m7 + t, 
or in symbols 


P{d < mr + t} = Oniexe-att), 0 St<r. (19) 
To determine g,,(t), we introduce the generating function 
G(z,t) = Le gm(é)e”. 
m=0 


Upon noting that 57%, -0 ta—-m(t)gm(t) = Gn, we have 


eo 


eB) SF gu(them Sma (te" 


m=0 n=0 


e—tB() > 2” Ds Tn—m(t)gm(t) 
n=0 =0 


G(z,t) 


I 


ioe] 
e7 tB(z) yy Anz” — eB) F(z), 


n=0 
Substituting for F(z) we obtain 
(2 — 21) (2 — 2e)- ++ (2% — 2e_s)e~ 8) 


(os a (20) 
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A direct expansion of the right-hand side of (20) in powers of z is 
not desirable because the coefficients of z involve sums the terms of 
which have alternate signs and, therefore, are not well suited for 
computation. To circumvent this difficulty, we first obtain a Laurent 
series expansion of G(z,t) in the annulus 1 < |z|] < |&|, where & is 
that root of 2° exp [—78(z) ] — 1 = 0 which has the smallest modulus 
exceeding 1. The existence of such a root can be proved as follows. 
Since x° exp [—7B(x)] takes on the value 1 at x = 1 (x = real part 
of z), has a positive derivative there, and vanishes at infinity, the 
equation 2° exp [—78(z) ] — 1 = 0 has at least one root outside the 
unit circle. 

For1 < |z| < |é|, the absolute value of z~¢ exp [78(z) ] is less than 
unity. Expanding the denominator in powers of 2~¢ exp [7@(z) ] and 
the exponential function in powers of z, and collecting like-power terms, 
we obtain, for 1 < |z| < |é|, 


oo c~1 ie) 
G(zjt) = Dy Lo an LD mespere—nl (J + Lr — tb] 2* (21) 
k=0 n=0 j=0 


where q, is the coefficient of 2" in the polynomial 
A(z — 21) (2 — @2)++-(@ — c_4). 


Since z = 1 is the only singularity of G(z, t) in |z| < |&| (asimple pole 
with residue —1), G(z, t) + (2 — 1) is holomorphic in |z| < |£] and 
hence for |z| < |&| 


1 00 
G(z,t) + Pea = expansion (21) + > 2”. 
Zz — 


n=1 
Therefore, for |z| < 1, we must have 
G(z,t) = >> 2" + expansion (21) + 3d) 2. 
n=0 n=l 


From this equation, we obtain the service-delay distribution 


c—1 oe) 
P{d <mr+é} =1— D an X memest2ye—n—aiL(j + 17 — t], 
n=0 j=0 


m=0,1,2,---, OSt<7r. (22) 


VI. A NUMERICAL EXAMPLE 


We examine a fixed-size batching scheme which provides some 
insight into the effect of batch arrivals on the average delay and the 
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probability of service-delay. Suppose customers arrive in batches of 
size m. Then A; = 0 for 7 4m, wi = Mm and 


pnrnr ent 
a; (t) = k! 


0 ; otherwise. 





; for jz = km 


Figure 1 shows the average delay experienced for an arbitrary customer 
as a function of the occupancy p = ry:/c, for various values of m. 
We assume that the holding time is unity, and that c = 4. 

Equation (18) written in the form 


ne—l 


1 (oe) 
D=— 5 ~ {nga —e tees) +E (no — jrs(n) 


Mi n=17 


was used to obtain the curves drawn in Fig. 1. We might point out 
here that the above series converges slowly when the occupancy is near 
unity. In the interest of speedy computation it may be necessary to 
solve for the roots of the denominator in eq. (8) and then use (13) 
to calculate the average delay. The same remarks apply to eq. (12) 
which is used to compute the probability of no delay. 

Because holding times are constant, several interesting phenomena 
are observed. First, if the batch size is an integer multiple of the number 
of servers, say m = kc, then the mean time until the service of an 
arriving batch (or the first customer from the batch) begins is the 
same as the average delay in a one-server system with single Poisson 
arrivals and holding time k. From the Pollaczek-Khintchine formula, 
this number is given by 


kp 
2(1 — p) 
Hence the mean delay which an arbitrary customer experiences is the 
average of the above number and the mean delay experienced by the 
last customer in the batch to be served. Thus we have 
k-1 pk 


Dice 
2 + oa») 


(m = ke). 
Note that as p > 0, D — (k — 1)/2. 

On the other hand, if the number of servers is an integer multiple 
of the batch size, say c = jm, then the system may be viewed as a 
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Fig. 1—Average delay in an M/D/4 queue when arrivals occur in batches of size m. 


collection of single-server systems with constant holding time and 
j-phased Erlangian input of mean offered load ui/j. This can be seen 
by imagining that the sets of m servers required to serve the arriving 
batches are chosen in cyclic order. 

Figure 2 shows the probability of service-delay (the probability that 
the service of an arriving batch is delayed) as a function of the oc- 
cupancy, for various values of the batch size m and c = 4. From eq. 
(12) we obtained and used the following expression for the service- 
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PROBABILITY THAT THE SERVICE OF AN ARRIVING BATCH IS DELAYED 





0.01 0.02 0.04 0.06 0.08 0.1 0.2 0.4 0.6 0.81.0 
p, OCCUPANCY 


Fig. 2—Probability of service-delay in an M/D/4 queue when arrivals occur in 
batches of size m. 


delay probability : 
oe) 1 ne—l 
1 — exp { - 2 -|1 - > n(n) |} 


n=1 j=0 


Phenomena similar to those observed in Fig. 1 exist here also. For 
example, if the batch size is an integer multiple of c, say m = kc, then 
the probability of service-delay is the same as the probability of delay 
in a one-server system with single Poisson arrivals, 1.e., it is simply p. 

On the other hand, if c = jm, then the service-delay probability is 
the same as would be found in a system with single Poissonian arrivals 
of intensity \ and j constant-holding-time servers. 
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Crosstalk in Uniformly Coupled Lossy 


Transmission Lines 


By J. C. ISAACS, JR., and N. A. STRAKHOV 
(Manuscript received July 5, 1972) 


The crosstalk between two identical, uniformly coupled, lossy transmis- 
ston lines ts examined. Equations are derived which can be solved to obtain 
formulas for the near-end crosstalk (NEXT) and far-end crosstalk 
(FEXT). An example ts worked which illustrates the mutual influence of 
the two lines in terms of the modal voltages and currents. The mutual in- 
fluence of the two lines is also studied by comparing the results of this 
example with the ‘‘classical’”’ crosstalk formulas which assume weak 
coupling and neglect the influence of the disturbed line on the disturbing 
line. It is shown that the influence of the disturbed line on the disturbing 
line can be neglected for NEXT for most weak coupling situations. For 
sufficiently high frequencies and/or long line lengths, however, this in- 
fluence cannot be neglected for FEXT. 


I, INTRODUCTION 


One of the earliest analyses of crosstalk in coupled transmission lines 
was made by Campbell ;! later Shelkunoff and Odarenko? used a similar 
method to analyze the crosstalk in coaxial structures. These ‘‘classical’’ 
formulas were derived for two parallel transmission lines with weak 
coupling and matched terminations. One drawback of these analyses 
is that they do not take into account the effect of the disturbed line 
on the disturbing line. However, their crosstalk formulas are simple in 
form and easy to analyze. Also, they are applicable to any parallel, 
uniformly coupled transmission lines with weak coupling. 

Somewhat later an analysis of coupled transmission lines was made 
by Rice.*? His results apply under quite general conditions and are 
expressed in compact matrix notation. However, his results have 
apparently not influenced current analyses, possibly because the 
formulas are more complicated to analyze than those in Refs. 1 and 2. 
Coupling between two pairs under similarly general conditions is 
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given by Kuznetsov and Stratonovich.* Although the emphasis is in 
obtaining results for the time domain, the basic approach is similar to 
the one we will follow. The specific time domain results do not apply 
to the transmission lines of interest here because the frequency de- 
pendence of the primary constants is not taken into account. More 
recent analyses®:® have relaxed the assumptions of weak coupling and 
matched lines and do take into account the effect of the disturbed line 
on the disturbing line. Unfortunately, these analyses focus attention 
on the lossless case in order to obtain crosstalk formulas which can be 
readily calculated. While the lossless case may be of interest for some 
line lengths and frequency ranges, it does not cover many applications 
which are of great practical interest. 

In this paper, a fairly general analytical model is presented for two 
identical, parallel, uniformly coupled transmission lines with a common 
ground return. This model does not assume weak coupling, matched 
terminations, or lossless lines. The resultant crosstalk equations, al- 
though somewhat unwieldy, can be evaluated with the aid of a 
computer. 

The motivation for this study was, in part, to assist in the analysis 
of special cables being utilized in the interconnection of equipment 
_racks. These cables, referred to as flat flexible cables, have conductors 
that are not twisted and therefore can couple to each other strongly 
under certain conditions. The results of this study are also of interest 
to those studying longitudinal mode coupling effects in multipair 
cable. 


II. DERIVATION OF CROSSTALK BETWEEN TRANSMISSION LINES WITH 
ARBITRARY CONSTANT COUPLING 


The starting point for this analysis is the set of coupled differential 
equations which are assumed to govern the two transmission lines. 
They are 


diy : i 
— = —(R+ jol)Iy — joLlle (la) 
dx 
dl, . : 
— = —(G4+ joC)Fi — jo Ee (1b) 
dx 
dks ‘ ‘ 
— = —(R + jol\la = jolili (Ie) 
dx 
dl, ; : 
= —(G+ jwC)E, — joC Ey (1d) 


dx 
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where 


E; is the voltage across transmission line 1, 7 = 1, 2 


I; is the current flowing in transmission line z, 7 = 1, 2 
R 

L | are standard distributed resistance, inductance, 

G | conductance, and capacitance, respectively 

C é 

w is frequency in radians/s 

L 


-| are “coupling” inductance and capacitance; the relationship to 
C..| physical quantities will be derived in a later section. 


A number of assumptions are tacitly implied in order for the equa- 
tions to describe the physical situation. These will now be discussed. 

The first and most basic assumption is that only two sets of voltages 
and currents are involved in the coupling mechanism. This assumption 
is readily met in the case of unbalanced transmission lines shown in 
Fig. la. However, for balanced transmission lines, depicted in Fig. 1b, 
other voltages and currents may play a role in the coupling mechanism. 
They will only be negligible if each transmission line is well balanced 
with respect to ground. 

Another important assumption is that the power propagating down 
the transmission lines is essentially described by TEM modes. This 
assumption is required to assure that the telegrapher’s equations 
[ie., (1) with L, = C. = 0] are valid. 

For the time being, it is not necessary to specify whether or not R, 
L, G, C, C., and ZL. are frequency independent. However, if these 
results are translated from the frequency domain to the time domain, 
the frequency dependence of these parameters will have to be specified. 








a 
GROUND-~ 
PLANE 


GROUND~~ 
PLANE 


Fig. 1—(a) Unbalanced transmission lines. (b) Balanced transmission lines. 
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Of course, it is a fundamental assumption of this analysis that the six 
parameters are independent of z. 

Differentiation of (la) and (1c) with respect to x and substitution 
of (1b) and (1d) for the appropriate quantities result in 








ay 
= Ankh, + Ak, (2a) 

dx? 

aH, 
= Ap; + Anke (2b) 

dx? 

where 

An = (R+ joL)(G + joC) — LC, (3) 
Age = Au (4) 
Ay = (R+ joL)joCe + (G+ joC) joL. (5) 


Assuming a solution of the form EH; = A,e?* and Ey, = Ae? for (2) 
yields 
y=+y7t or +y~ 


where 
yt= VAG Ags 
= {LR + jo(L+ L)ILG + jo(C + C.)}}} (6) 
y = VAun — A 
= {LR + jo(L — L.)ILG + jo(C — C.)]}} (7) 
and 


Ai if y=uatyt 
A» — 3 
—A, if y= ty. 
Therefore, the general solutions for £i(x) and £,(x) are expressed as 
E,\(z) = Atev* + A-ev* + Bte-1"** + Bre“? (8a) 
E,(z) = Atev** — A-ev* + Bte-1** — Bre-v* (8b) 


where the four constants A+, A~, B+, and B- will be determined from 
boundary conditions. The corresponding expressions for the two 
currents can be obtained by solving (la) and (1c). After the required 
algebraic manipulations, one obtains 


1 1 1 1 
I = — —Aterts — —A-er? 4+ —Bte-v* + —B-e-vr* (9a) 
(2) Zt Z Zt Z me 


1 1 1 1 
= Sa +z ——A-pr7z —Bte-ytz — ——B-pe-17-z 
I2(x) = (Ate? fe EY Re ~B e77 -B-e~7 (9b) 
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Zy T,(o) I, (2) 
+ + + 
Vy E,(o) ey ae E,(2) Z> 
In(o) I, (2) 
———_ 
+ + 
23 Ep(o) soe E,(2) 24 








Fig. 2—Boundary conditions imposed on coupled transmission lines. 


where : 
q+ = [| (10) 
G+ jw(C + C,) 
= a aa aes =. (11) 
G+ jo(C = C.) 


The boundary conditions that will be imposed are shown in Fig. 2. 
The corresponding boundary condition equations are: 


Vi = Zili(0) + £1(0) (12a) 
O = Z3I2(0) + E2(0) (12b) 
O = Z.1,(l) — Fi) (12¢) 
O = Z4I,(l) — E2()). (12d) 


Substituting (8) and (9) into (12) results in four equations for the 
four unknowns At, A-, Bt, and B-. Solving for these quantities and 
substituting them into (8) yield a solution* of the form 








Ey (2) 

y, 7 ght (x) + 2R-(z) (18a) 
E2(2) 

7 = $Rt+(x2) — $R-(2). (13b) 


The near-end crosstalk is given by £.(0)/E.i(0) whilet the far-end 
crosstalk (equal level) is given by E2(l)/Ei(l). 


*In principle, (13) could be derived from eqs. (1.25) and (1.30) of Ref. 3. 
However, applying the boundary conditions (12) to these equations leads to sufficient 
algebraic complication that it is easier to derive (13) directly. 

t The conventional definition of near-end crosstalk is E2(0)/V1 which is equivalent 
to the above definition (except for a factor of 2) under the conditions of loose coupling 
and matched terminations. 
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Obtaining expressions for R+(z) and R~(x) involves very extensive 
algebra for arbitrary impedances and in general does not lend insight 
into the coupling process. For applications where switching circuits 
are involved, several special cases of interest partially simplify the 
algebra in obtaining expressions for the near-end and far-end crosstalk. 
Some of these cases are: 


(t) Z1 = Z; and Z, = Z,,’ (possible application to analog switch- 
ing systems). 
(2) Z1 = Z3 = Z, Z, = Zs = @& (possible application to switching 
systems using ‘‘totem pole”’ logic). 
(1) Z, = 0, Z, = Z3; = Zs, = & (possible application to switching 
systems using simple transistor logic). 


These cases all involve somewhat bulky expressions, but they can be 
obtained with perseverance. 

The case that will be studied in detail in the following section is 
Z1 = Zp = Z3 = Z4. This case is of special interest for three reasons: 


(2) The coupling capacitance and inductance can be related easily 
to physically measurable quantities. 
(72) The conditions under which the ‘‘classical’”’ crosstalk formulas 
apply can be studied. 
(221) This case is of interest for many applications involving analog 
circuits. 


III, RELATIONSHIP TO PHYSICAL QUANTITIES 


The behavior of the coupling process is most easily illustrated by 
modifying the excitation assumed in (12). Instead of only exciting 
circuit 1, an excitation will also be applied to circuit 2 as shown in 
Fig. 3. The set of equations, (12), is modified by letting Z; = Z, = Zs; 
= Z, and replacing (12b) by 


pVi = Zil2(0) + E2(o) (14) 
where p is a complex scalar. Obviously, the case p = 0 corresponds to 


the situation in Fig. 2 with equal terminating impedances. With this 
substitution, (13) becomes 














Ex(z7) l+tp_4 l—p __ 
v, a R(x) + ae R, (x) (15a) 
= = Ri (a) — 2 Bee) (15b) 
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Z, Tjlo) T, (2) 
—_> 
+ i + 
i E; (0) Pits E, (2) Z; 
Z, T>(o) T,(2) 
+ + + 
pV; E2(0) coe E (2) Z, 








Fig. 3—An alternate means of exciting the coupled transmission lines. 


where win a 
4 Poet -® — Pye 
tO) = pat Pee oe 
00 10 
= Poe‘) = Pye 
0) pr grt = Pier od 
01 11 
and 
P a 17 
o =1+ Ge (17a) 
P = 17b 
wo =1—- Gt (17b) 
Pei (17¢) 
= ars c 
01 Z- 
ae ees (17d) 
11 — J- 


It is now apparent that any excitation of the two coupled trans- 
mission lines depicted in Fig. 3 will result in a response which will be a 
linear combination of the two functions R;*(x) and R; (x). Therefore, 
these functions will be referred to as modes. They will now be examined 
in somewhat greater detail. 

If p = 1, then (15) reduces to 


oe ss a = Ri(z). (18) 


Therefore, if the two lines are energized with equal and in-phase 
sinusoids, the resulting voltage distributions are given by R;*(x). Note 
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that R;*(x) contains terms with a + superscript and does not contain 
any terms with a — superscript. This, in turn, signifies that the 
propagation constant and the characteristic impedance associated with 
R;*(x) are given by (6) and (10), respectively. This result will now 
be interpreted in terms of the distributed capacitance and inductance 
associated with the two transmission lines. 

Figure 4 shows a cross section of the two coupled transmission lines, 
assuming symmetric excitation (9 = 1). According to (18), the voltages 
on the two lines are equal at every point x; this fact is indicated on 
Fig. 4. The capacitance per unit length of each conductor to the 
ground plane is denoted by C,, while the coupling between conductors 
is denoted by Cie. 

Since there is no potential difference across Cie, the signals propagat- 
ing along the two transmission lines are not affected by it. Therefore, 
each signal propagates along its respective transmission line as if the 
two lines were uncoupled and with distributed capacitance: 


C+C. = C;. (19) 


The distributed inductance can be expressed in terms of C, using the © 
relationship : 
(C + C.)(L + Le) = ne, (20) 


(See, for example, Chapter I, Sec. 4, eq. (31) of Ref 8.) This 
formula is applicable to the case where the frequency of excitation is 
sufficiently high that the magnetic field penetrating the metal con- 
ductors contributes a negligible amount to the coupling inductance. 
Thus 








Lites = (21) 
Co C. 
Turning now to the case p = —1, eq. (15) yields 
Ex(a) E(x) = 
re ead = Reo wv). 22 
v, v, (x) (22) 


The propagation constant and characteristic impedance associated 
with this mode are expressed by (7) and (11), respectively. As with the 
previous mode, this mode behaves as if the two lines were uncoupled 
but with primary constants R, G, C — C., and L — L,. To see how 
these are related to the physical capacitance, it is convenient to 
depict the voltages and capacitances as shown in Fig. 5. 

As indicated by (22), the voltages on each transmission line are 
equal but opposite in sign. A vertical line between the two conductors 
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Cio 
: + ; 
Coa Ey Cy E,=E, 
GROUND PLANE 


Fig. 4—Cross section of two coupled transmission lines—symmetric excitation. 


must therefore constitute a surface of ground potential. Thus, the 
total capacitance to ground influencing a signal propagating along 
either line is given by 

C— CC. = Cy + 2Cie. (23) 


As in the previous case, one may take (C — C,)(L — ZL.) = we if the 
frequency is sufficiently high. Thus 
pe 


L—- L, = ————-: (24) 
C; + 2C 12 
Combining (19) and (23) to solve for C and C, yields 
C=C,4+ Cr (25) 
and 
Cs — —Cp2, (26) 
while combining (21) and (24) to solve for L and L, results in 
C C 
= pene (27) 
Cy(C, + 2C 12) 
and 
C 
12 (28) 


= we —————— 
Cy(Cg + 2C12) 


One final observation is that, in the higher frequency bands of in- 


ee 
2012 | 2C42 
I + 
+ | : ec 
Cy >, Cy =x Eg = Ey 
= 
YA 
GROUND PLANE 


Fig. 5—Cross section of two coupled transmission lines—asymmetrical case. 
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terest, the following approximations can be made with little error: 





ee ee (29a) 
2 Zt 
R 1 _ 
y= 2 Ze + jovpe (29b) 
L+LA} 
aa (TF) ai 
[-& (- - =). (29d) 
C es Cx 


(See Chapter II, Sec. 18, eqs. (18), (8), and (6) of Ref. 8.) 
Substituting eqs. (25) to (28) into the above equations yields 








1 RC, a 
yt Se + jovpe (30a) 
2 Vue 
1R(C, + 2C met 
Bee a oe re (30b) 
2 vue 
vue 
Zt=> 30 
C, (30c) 
Vue 
-=—_—-_. 30d 
Cy t+ 2Cir2 


Thus, the R; (x) mode has a higher loss and smaller characteristic 
impedance than the R;* (x) mode. 

It is now possible to outline a measurement procedure that will yield 
all quantities required to evaluate (13). Since the effective dielectric 
constant surrounding most physical transmission lines is determined 
by the detailed geometry of the insulation and shields surrounding 
each conductor, the quantity Vue will be assumed unknown for the 
following procedure, even though » and e may be known for each 
constituent material in the transmission line. 


Step 1. Measure C, and Cio. 

Step 2. Terminate the coupled pairs in four equal impedances 21 
and energize the two lines from the same voltage generator. The 
generator frequency should be in the range for which the approxi- 
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mations leading to (29) and (30) are valid. In other words, terminate 
and excite the lines as shown in Fig. 3 with p = 1. 

Step 3. Adjust all four impedances, Z,, until Rj“ (1) is a maximum. 
It is easy to show that in this case Z; = Zt. Using (380c) and the value 
of C, from Step 1 gives Vue. 

Step 4. With Z; = Z+, measure R;' (1) which equals (4) exp (—y*l). 
With yt given by (30a), R can be solved for directly, given the length 
of the coupled lines, J. 

Step 5. The remaining quantities, y~ and Z~, can now be evaluated 
using (30b) and (30d). Note that there is an additional check on the 
value of Vue through the imaginary part of yt. 


IV. COMPARISON OF RESULTS WITH CLASSICAL CROSSTALK FORMULAS 


We now use the results of the previous section to analyze the 
“classical”’ crosstalk formulas as derived by Shelkunoff and Odarenko.? 
Their analysis assumed uniform weak coupling between two parallel 
transmission lines terminated in their characteristic impedances. 
Assuming the two transmission lines had identical primary and 
secondary constants, they derived the following formulas for near-end 
crosstalk (NEXT) and far-end crosstalk (FEXT): 


N(w) = =e Clee) (31) 
4Zoyo(w) 
F(w) = fas (32) 
ay 


where Z12, Zj. are the mutual impedances between the two lines for 
NEXT and FEXT, respectively, Zo and yo the secondary quantities 
of an isolated line [i.e., (10) and (6) with L, = C, = 0], and | the 
length of the lines. The above formulas were derived neglecting the 
effects of the disturbed line on the disturbing line. In the discussion 
that follows, we shall examine the validity of this assumption. 

Referring to (6), (7), (10), and (11), and assuming L, « L, C, « C, 
and Z; = Zo, it can be shown that 


Gee 7, A25 (33a) 
jw 2 
270 
—|<K] 33C 
Zi, (33¢) 
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and 
yt =yote (34a) 
ee, one 
€= ———— 
aie 
arate (34¢) 
Yo 








Now using (33), (34), (18), (16), and (17) one can show that 


6 
1 — e*7%!| cosh (2el) — 7 — sinh (2el }) 
oe | ea iis 
E;(0) =. Zo 16 
where the approximation is obtained by only assuming weak coupling. 
Now for sufficiently small | «l|, the term in brackets is approximately 
unity so that (85) becomes 





N(w) = 


6 
N(w) = Z, (Leen) 


qu 
4yoLo 
This agrees with the Shelkunoff and Odarenko result, (31), with 
Z12 = ja(Le —~ C325). 

For larger values of J, the exponential term in (35) can usually be 
neglected for lossy lines. In the lossless case, the term in brackets will 
cause a departure from the classical formula for sufficiently large | «| ; 
however, in weak coupling situations, the length and/or frequency 
required to invalidate the approximation cosh (2«l) = 1 are usually 
large enough to invalidate the lossless assumption. Thus, for most 
practical situations involving weak coupling, eq. (81) is adequate. 

We now consider far-end crosstalk. Again referring to (13), (16), 
and (17), letting x = 1, and assuming the conditions for weak coupling 
exist, it can be shown that 





(Le — C.Z0)(1 — e202), (36) 


yt evil 


é 
Fw) = ara (37) 


Substituting (84a) into (87) results in 
eel ‘cow ef! 
eel + et 


= —tanh (cl). (38) 


F(o) = 
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Referring to (34b), for sufficiently large w, Zo is a real constant and e 
is an imaginary number; thus, (38) can be written as 


F(w) = —7 tan (— jel) (39a) 


~ —-ed for | el | <2 
6 


(Le + ZC 
— Jw Se 


39b 
on, (39b) 


Therefore, for |e| < (1/6), eq. (39b) agrees with (32) with Zi. 
= —jw(L, + Z2C.). Shelkunoff and Odarenko? point out that (32) 
must not be carried to an absurd conclusion: namely, that most of 
the far-end power will reside in the disturbed circuit for sufficiently 
long transmission lines. They conjecture that, in the limiting case, the 
far-end power will divide equally between the two lines. Equation 
(39a) indicates that the far-end power oscillates back and forth between 
the two lines as a function of / (or frequency, since ¢ is a function of w). 
Equation (39a) is valid over a larger range of J than (32) (or 39b), 
although it is not valid for all 1, since it is based on an approximation, 
(34), which is multiplied by J. To be more specific, (6) and (7) can be 
written as 





v® = [vo — LC) + jeryo(ZoC + Le/Zo))}! 
jw 2 w? 2 
= vf ae (L, + C.Zo) we “1.0.| . (40) 
YoZo Yo 


Assuming the conditions for weak coupling (L.<«<L, C.<«C), the 
third term in the brackets is much smaller than the second term, and 
the second term in the brackets has a magnitude much less than unity, 
so (34) is a good first-order approximation to (40). Now for Zp a real 
constant, ¢€ is an imaginary quantity. However, for any given frequency 
the higher order terms from (40) contain real parts which will dominate 
the behavior of the exponential terms in (87) for sufficiently large 1. 
Thus, referring to (37), in the limit as 1—>~, 


1 
lim |F(w)| = lim 


lperni a 


oe ee 





> 1 +00 


The same result is reached by fixing / and letting w ©. 
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In summary, when weak coupling conditions exist, the effect of the 
disturbed line on the disturbing line can be neglected for most NEXT 
calculations; however, for a sufficiently large / and/or w, the effect of 
the disturbed line on the disturbing line cannot be neglected for 
FEXT calculations. This is because, for certain values of / and/or w, 
the far-end power in the disturbed line will be comparable to the far- 
end power in the disturbing line. 


Vv. CONCLUSION 


An analytical model for analyzing crosstalk between two identical, 
parallel, uniformly coupled transmission lines with ground return has 
been presented. Using this model, formulas were developed for the 
two sets of modal voltages and currents on the transmission lines. It 
was found that each mode has associated with it a propagation factor 
and characteristic impedance which, in general, are different for each 
mode. 

By applying different sets of excitation voltages to the two lines 
(changing boundary conditions), the effect of each line on the other 
can be analyzed in terms of the modal quantities. Using this technique, 
formulas were derived for the coupling capacitance and inductance in 
terms of the distributed capacitance and distributed inductance of an 
isolated line, the distributed capacitance to ground for the nonisolated 
lines, and the permeability and permittivity of the medium surround- 
ing the transmission lines. 

The mutual influence of the two lines was also studied by assuming 
weak coupling between them and then deriving NEXT and FEXT 
formulas using this model. These formulas were compared with the 
classical formulas which do not take into account the influence of the 
disturbed line on the disturbing line. In the case of NEXT, the effect 
of the disturbed line on the disturbing line was found to be negligible 
for most practical cases. In the case of equal level FEXT, however, the 
effect of the disturbed line on the disturbing line can be quite signifi- 
cant for sufficiently large line length and/or frequency. 
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Applications for Quantum Amplifiers in 
Simple Digital Optical Communication 
Systems 


By 8S. D. PERSONICK 
(Manuscript received August 14, 1972) 


Previously published results on the performance of optical direct de- 
tection digital receivers using avalanche detectors are extended to the case 
where incoherent noise due to quantum amplifiers in the transmission 
medium is present at the detector. These calculations are applied to de- 
termine the usefulness of quantum amplifiersin simple digital transmission 
systems where the optical source instability results in a required amplifier 
bandwidth which may be orders of magnitude greater than the modulation 
bandwidth. It 1s concluded that practical applications exist where quantum 
amplifiers can be used in analog repeaters between regenerating repeaters in 
a hybrid digital system; and also as front ends of regenerating repeaters to 
increase their sensitivities. 


I. INTRODUCTION 


Quantum amplifiers can be used in optical communication systems 
even if the optical sources are only partially coherent. They can serve 
as optical analog repeaters between regenerating repeaters in a 
hybrid digital system to compensate for transmission loss (see Fig. 1), 
and also as the front ends of regenerating repeaters which demodulate 
back to baseband. 

This paper investigates the applications for quantum amplifiers 
in simple digital communication systems employing “‘on-off’’ intensity 
modulation. It will be assumed that due to source instability, the 
optical system bandwidth may be orders of magnitude greater than 
the bandwidth of the modulation, and that the quantum amplifiers 
have limited gain. 

We shall calculate Chernov bounds on the required signal energy per 
pulse at the detector of a digital repeater (to be described in detail 
below) to achieve a 10~° error rate as a function of the received sponta- 
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Fig. 1—Fiber communication system. 


neous emission noise level from quantum amplifiers, or of the incoherent 
background noise level for some typical values of dark current of the 
detector, mean detector gain, and thermal noise introduced by circuitry 
following the detector. For these Chernov bound calculations, we shall 
assume a unilateral gain detector. Numerical results for other detectors 
and parameter values can be obtained by using the moment generating 
functions to be derived below with the results of two previous papers!” 
concerning Chernov bounds for direct detection intensity modulation 
systems using avalanche gain. 

We shall also derive some signal-to-noise ratio results which can be 
used to approximate the energy required per pulse to achieve a desired 
error rate. These signal-to-noise ratio results are consistent with 
previous published work of other authors.*-> 

Throughout this paper it will be assumed that the modulation 
consists of varying the intensity of the transmitted signal in each 
baud interval to produce pulses at the regenerating repeater which are 
of one of two amplitudes, and such that the pulses are approximately 
constant in a baud interval of length 7 seconds. Generalization to 
other pulse shapes should be straightforward using the results below. 


II. A MODEL FOR THE QUANTUM AMPLIFIER NOISE 


Throughout this paper we shall model the source as follows. Its 
voltage in a single spatial mode will be given by 


E(t) = v2re{ Aero} (1) 
where |w| < 27B/2. 

That is, the source will be nominally at optical frequency 2/27 but 
due to source instability there will be an uncertainty of bandwidth B. 
(The conclusions and numerical results that follow also hold if the 
source is a randomly phase-modulated sinusoid having a bandwidth B 
of the form F(t) = V2re{Ae*@()+24 } |) | 

If the modulated signal is to be transmitted over a channel with 
quantum amplifiers and possibly with optical filters as well, then these 
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devices must have a bandwidth of at least B + 1/T to accommodate 
the modulated signal for all possible values of w. (The term 1/7 is due 
to the increase in bandwidth of the source due to the modulation.) 

At the regenerating repeater input, the classical field will be modeled 
as follows (assuming only a single spatial mode) 


E,(t) = V2re{m(é)e*r)? + n(t)} (2) 


where m(t), the modulation, assumes one of the two possible pulse 
amplitudes (a pulse which is approximately constant in each baud 
interval 7’) and n(t) is a complex Gaussian random process which 
represents the incoherent spontaneous emission noise introduced in 
the quantum amplifiers or represents incoherent background noise.°® 
In each baud interval 7, we can expand the field complex envelope in 
a Fourier series;' taking only enough terms to include the “‘system”’ 
bandwidth B’.* 


( L—~1) /2 et(2rkt/T) 
ct) = mew +n) = n{ —— | (3) 
sayy T 


where L (the number of temporal modes) is given by L = B’T 21 
+ BT. Defining 


1 
‘ VT is interval ( ) , 
= | (dyerterberrr dy (4a) 
n,=-= nlbje*™ a 
: VT baud interval : 


we have for each value of k, a, = mz + nx. Because we have a digital 
system, the signal components, m;z, take on one of two values for each 
k. The noise components n; are complex Gaussian random variables. 


(nun;) = Ndx;, (nen;) = 0 (4b) 


where J, is the classical incoherent noise spectral height’: (n(t)n*(7)) 
= N,6(t — r), and (x) stands for the expected value of zx. 


t A more rigorous and general approach taken in the Appendix is to expand the re- 
ceived field in a Karhunen-Loéve expansion using the autocorrelation function of the 
noise n(¢) at the detector input as the kernel. The approach taken here is justified on 
grounds of simplicity and intuitiveness. 

t The system bandwidth B’ is the minimum of the quantum amplifier bandwidth, 
the detector optical bandwidth, and the bandwidths of any filters in the optical path 
preceding the detector. Of course B’ > B + (1/T), if we are to accommodate all 
possible signals with the unstable source described above. 

§ That is, the number of watts of incoherent power falling on the detector in the 
bandwidth B’ is N,B’. 
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At the regenerating repeater it will be assumed that the field falls 
upon a detector with internal gain (e.g., an avalanche detector) and 
causes the detector to emit ‘‘primary” hole-electron pairs at rate 
(average pairs/second) 


0 2 
A(t) = ra e(t) | (5) 


where # = Planck’s constant /27, 2 = optical frequency in radians/s, 
n = detector quantum efficiency, and e,(¢) was defined in eq. (3) above. 

Due to internal gain, each primary ‘‘count”’ (hole-electron pair) 
produces a random number of additional secondary counts. Because 
the modulation pulse is approximately constant throughout a baud 
interval, we will be interested in the total number of counts produced 
by the detector due to signa] and incoherent noise in each baud interval. 
The moment generating function’ of the random total number of 
counts, N, produced in each baud interval, 7’, is defined as 


ee) 


My(s) = L e*p(n) (6) 


nrn=0 


where 
p(n) = probability that N = n. 


From previous work! we have 


My(s) = Mc(¥c(s)) (7) 
where 


We(s) is In[Me(s) ]. 


M <(s) is the moment generating function of the random internal gain 
G and M¢(s) is the moment generating function of the total number of 
primary counts, C. 

We can evaluate M¢(s) as follows. Define the quantity A as 


n n (L—1) /2 
A=— e,(t) ("dé = —- > ax,|? (8) 
AQ J baud interval | ( | AQ _ (1r—1) /2 | 4 


where A is the average number of received primary counts in a baud 
interval given e,(t). A is a random variable, since the {a,} are random 
variables having the following joint complex Gaussian probability 
density 

Nt 





L 
p tas, da: --az} =, II e—lan — mk|?/No, 


k=1 tN, 


The probability distribution of the total number of primary counts 
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Fig. 2—Twin-channel system. 


C in a baud interval given A is Poisson, i.e., 


Cpo—A 





p(c|A) = = probability that C = ¢ given A. 


c! 


It follows that M¢(s) is given by 


Me(s) = i & pe|aer focayaa 


ie ele (A)dA 
ae same 


XK El (n/hQ) B |mel2(e* — Y)/LL — (1/A2)Nole#- 11}, (9) 
III. SIGNAL-TO-NOISE RATIO RESULTS 





From (7) and (9) we obtain the mean number of counts, (NV), 
emitted by the avalanche detector in a baud interval as follows 


6) 0 
(N) = aoe = aves] velar oat) 5 ~vo(s)| 


G[m? + LN, |n/iQ (10) 


II 
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where 
(L—-1) /2 


m = i: |m(é)|*dt= DY |m,z\|?, 
baud interval 


—(L—1)/2 


N. = (|nx|*) = classical spectral height of the incoherent noise 
at the detector input, 
BT +1, 


L2(B+1/T]T 


and G is the mean avalanche gain. 
The variance of the total number of counts is 


) 
s=0 


—— 2 z n \? 
24 LN.) —@ + (LN, + 2Nam?)(G)? (— 
(m? + ses ae + m?) (@) (=) (11) 


32 
—— My(s) 


(Nt) — WP = 





: M 
= (= n(s) 





s=0 


ww 











= eee ae . ONE. . 
shot noises beat noisest 


where G? is the mean square avalanche gain. 

Consider a typical twin-channel digital system, shown in Fig. 2. 
There is light incident on each detector containing ‘‘on-off’’ modulated 
signal pulses of duration 7 and incoherent noise. A channel is in the 
“on” state when its signal pulse has optical power p. In the “off” state 
the signal pulse power is p-HXT, where EXT is small compared to 
unity. During each baud interval, one or the other channel is ‘“‘on.”’ 
The detectors are assumed to have internal random gain (e.g., av- 
alanche gain or photomultiplier gain) and there are assumed to be 
thermal noises added to the detector outputs due to the amplifiers 
following the detectors. It is assumed that the signaling rate is slow 
enough so that each signal pulse of light of duration 7’ produces an 
output current from its detector of duration 7 that does not overlap 
with the currents from other pulses. The detector output current 
pulses plus the corresponding noises are integrated in each period T 
(or equivalently filtered). The output variable x is compared to the 
threshold after each integration to decide which channel is ‘‘on.’”’ An 
error is made if x > 0 when the “‘zero”’ channel is on, or vice-versa. 

The baseband noise-to-signal ratio is defined as the variance of the 
output voltage x divided by the square of the mean of the output 
voltage x. 


t The term ‘‘beat noise’ has been used in literature’ to describe those noise terms at 
the output of a square law detector which are due to fluctuations in the instantaneous 
power of a carrier which has a fluctuating amplitude. 
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where 


ké@ = Boltzman’s constant-absolute noise temperature referred 
to the integrator input. 
R = integrator equivalent thermal noise input resistance. 
ha = mean dark current counts per detector per interval 7 before 
avalanche gain. 
A, = mn/hQ = mean signal counts per interval 7 in ‘‘on’”’ chan- 
nel before avalanche gain. 
DX, = mean incoherent noise counts in either channel per baud 
interval 7 before avalanche gain. 
BT + 1, and equals the number of temporal modes de- 
tected. 
EXT = Signal power in “off” channel/signal power in ‘‘on”’ channel. 


L 


IV 


In eq. (12), terms which are due to the incoherent spontaneous 
emission noises of the quantum amplifiers (or background noise) are 
marked with arrows. 

We see that the optical incoherent noise, when detected to baseband, 
causes additional shot noise and also contributes two beat noise terms. 
One of these is proportional to the signal \, and one is proportional to 
L. One can use these signal-to-noise ratio results to approximate the 
error rate by assuming that the output variable x is roughly Gaussian 
in distribution. 

In the next section we shall generate some curves that may give a 
clearer picture of the effects of L, Xn, \s, etc., on performance. 
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IV. CHERNOV BOUNDS 


The moment generating function defined in (9) was used with 
previously published results!? on avalanche photo-diode gain statistics 
to obtain Chernov upper bounds on the energy per pulse required at 
the input of a digital twin-channel regenerating repeater of Fig. 2 to 
achieve a desired error rate as a function of the other parameters. 

The general Chernov bound is given as follows.’ Let X be a random 
variable with moment generating function M,(s). Let Pr,(« > y) be 
the probability that an outcome zx of X exceeds y. Then it follows that 


Pry,(z > vy) S el¥x()-s7] for s>O0 (13) 
where 


¥x(s) = In[M,(s)]. 


The bound is optimized for s such that (dy~(s)/ds) = y provided 
that value of s is greater than zero. 
Similarly, 
Pry(a@ < y) S el’x()-s71 for s <0 (14) 


where the optimal value of s is given by (d¥(s)/ds) = y provided that 
value of s is less than zero. 

To obtain Chernov bounds upon the probability of error for the 
twin-channel system of Fig. 2, one needs the moment generating 
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Fig. 3—Required energy per pulse normalized by 7/%© vs the incoherent noise level 
N, at the detector, also normalized by 7/f2. 
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Fig. 4—Same as Fig. 3. 


function of the output variable x. This can be obtained using (9) and 
the results of the Refs. 1 and 2, which are too detailed to duplicate here. 

From simple cases where error rates can be calculated exactly, the 
differences in required power between those results and the bounds 
are typically a few dB or less. Experimental results also confirm the 
tightness of the bounds. Therefore, in this paper we shall take the 
liberty of comparing the effects of various parameters upon the 
required energy per pulse to achieve a desired error rate by comparing 
the bounds. 

It was decided that the calculations should be presented graphically 
in two ways. 

First, in Figs. 3 to 5, the required energy per pulse normalized by 
n/hQ (i.e., the mean number of detected signal photons per pulse) is 
plotted vs the incoherent noise level NV, at the detector also normalized 
by 7/#Q. This is done for various values shown of L, mean avalanche 
gain, thermal noise, dark current, error rate, and extinction ratio, for a 
low-noise unilateral gain avalanche detector (i.e., a detector in which 
only one type of carrier causes ionizing collisions, and where carrier 
injection is from one end of the high field region). The avalanche gains 
used in these calculations do not minimize the required energy per 
pulse for the given values of the other parameters, but were used for 
illustration. 
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Fig. 5—Same as Fig. 3. 


It was recognized that in a hybrid system, if the loss between the 
regenerating repeater and the analog repeater closest to it is in- 
creased, then the signal energy per pulse at the regenerating repeater 
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Fig. 6—Required energy per pulse normalized by 7/f#2 vs the ratio Z of spontane- 
ous emission noise spectral height to signal energy. 
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Fig. 7—Same as Fig. 6. 


input will decrease while the ratio of signa] energy per pulse to spon- 
taneous emission noise spectral height at the regenerating repeater 
input will remain fixed. Thus in Figs. 6 to 8, the required energy per 
pulse normalized by 7/#Q is plotted vs the ratio Z of spontaneous 
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Fig. 8—Same as Fig. 6. 
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emission noise spectral height to signal energy, for the various values 
shown of other parameters. 


Vv. APPLICATIONS AND EXAMPLES 
5.1 Analog Repeaters 


Suppose one used quantum amplifiers in analog repeaters placed 
between regenerating repeaters so as to increase the distance between 
regenerating repeaters. See Fig. 1. Each quantum amplifier introduces 
a spontaneous emission noise which has spectral height referred to its 
input given by® 


G,=4 
Ninput = FAO (“-) (15) 
Gq 


where G, is the quantum amplifier power gain and F is a noise figure 
which can be near unity for good quantum amplifiers and is typically 
less than 10.‘ If the input to analog repeater k is a; nepers (in power) 
higher than the signal level at the input of the regenerating repeater, 
then the total spontaneous emission noise spectral height N, at the 
input of the regenerating repeater is 


ss Gar — 1 
No = > rag Ge) e-% (16) 
1 Gok 


where R = number of analog repeaters. 

The ratio of N, in (16) to the signal energy per pulse, p-7' at the 
regenerating repeater input, see Fig. 2, is the parameter Z defined in 
Section IV above. Since the incoherent noise and the signal both 
experience equal loss per unit length from the fiber, the ratio Z is 
constant between the regenerating repeater and the analog repeater 
closest to it. 


Example: Suppose we make the following assumptions. A twin-channel 
system is used with a unilateral gain detector having mean gain 100 
and with all the other parameter values necessary above so that the 
Chernov bound curves of Fig. 8 are applicable. The source is a 
Nd:YAIG laser having bandwidth 1 A at wavelength 1 um, ie., 
3-10! Hz. The modulation rate is 300 Mb/s so that T ~ 3.383 X 10-°s. 


t F is related to the population inversion in the amplifying medium which is as- 
sumed constant in this analysis. 
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We then have L = 100. There are 10 analog repeaters and they are 
spaced so that the signal level is the same at the input to each one. 

From Fig. 8 (assuming that the upper bounds are tight enough so 
that we can comment upon the effects of various parameters on the 
required energy per pulse by observing their effects upon the bounds') 
we see that when Z is less than 10-’, the required signal energy at the 
regenerator input is 600 counts, i.e., p-7 = hQ/n-600. This value of 
signal energy is the same as that which would be required if no sponta- 
neous emission noise were present (Z = 0). 

Thus for spontaneous emission noise to be negligible in this example, 
we must have the ratio of the signal energy per pulse at the regenerator 
input to N, larger than 10%. 

This means [from (16) ] that at the analog repeater inputs the signal 
level must exceed 103-7QRF[(G,—1)/G,] where 


R = number of analog repeaters = 10 in this example 

G, = gain of analog repeater (assumed the same for all repeaters) 

F = noise figure of an analog repeater (assumed the same for all 
repeaters). 


Looking again at Fig. 8, we see that for L = 100, Z can be as large 
as 5 X 10-3 before the required signal level at the regenerating repeater 
becomes large and enters the sensitive region. This means that the 
signal level at the inputs to the analog repeaters might be as low as 
200-2QRF[(G, — 1)/G,] in which case the signal required at the 
regenerating repeater is somewhat larger, but still not extremely 
sensitive to small changes in Z. Suppose F = 10, n = 1, G, = 100, and 
the maximum power output of any repeater is 1 mW. Suppose the loss 
of the medium is 10 dB/km. When spontaneous emission noise is 
negligible, we need 600 #2 = 1.2 X 1071* joules per pulse at the input 
to the regenerating repeater and we have 3.33 X 10-” joules per pulse 
at the output. Without analog repeaters we can have about 44.5 dB of 
loss or 4.45 km between regenerating repeaters. Suppose on the other 
hand we use 10 analog repeaters starting where the signal] level is 200 
hORFT(G, — 1)/G,] = 4 X 10-" joules per pulse (i.e., Z = 5 X 107%); 
or about 28.8 dB (2.88 km) from the regenerating repeater output. 
The string of 10 analog repeaters spaced at 20-dB intervals spans 200 
dB or 20 km of distance; and we can have an additional 13 dB or 1.3 km 
of distance to the next regenerating repeater input resulting in the 
required 2 X 107° joules per pulse at that regenerating repeater input. 


+t See comment Section IV. 
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The total distance between regenerating repeaters is now about 24.2 
km.* 

It seems prudent that for a given value of LZ, one should avoid values 
of Z which are so large that small changes in Z result in large changes 
in the required signal energy at the regenerating repeater. Such small 
changes in Z might come about if the source power or quantum ampli- 
fier gains fluctuated slightly. 


5.2 Regenerator Repeater Front End 


Suppose that in the example above we had just one quantum 
amplifier (or equivalently, the spontaneous emission noise from any 
additional quantum amplifiers was negligible). 

In the absence of spontaneous emission noise, the energy per pulse 
required at the regenerative repeater input is approximately 600 7Q/n. 
Now suppose we place the quantum amplifier immediately before the 
regenerating repeater. If the gain is sufficiently large, then we can 
operate with Z as large as 7-10-*. This means the energy per pulse at 
the input to the quantum amplifier need only be about 7QF/(7 X 107°) 
~ 140h0F (for large G,). Thus, we see that if 140F < 600/n, then the 
quantum amplifier increases the sensitivity of the regenerative repeater 
over that associated with an avalanche detector alone (in this example 
with L = 100). 

For other values of L in this example, the condition for a quantum 
amplifier front end to increase the regenerative repeater sensitivity is 


F 600 





< 
Zmax n 


where Zmax is the maximum value of Z for reasonable required energy 
per pulse at the input to the regenerating repeater (following the 
quantum amplifier). 

For other systems with different types of avalanche detectors and 
different parameters (avalanche gain, dark current, etc.) the number 
600 in the above equation should be replaced by the required mean 
number of detected counts in the absence of a quantum amplifier. 


+ A slightly larger total distance between regenerating repeaters can be obtained by 
starting the chain of analog repeaters 20 dB (rather than 28.8 dB) from the regener- 
ating repeater output. In that case Z & 5-107‘ and the next regenerating repeater can 
be about 45 dB from the last analog repeater for a total span of 24.5 km between re- 
generating repeaters. Placing the analog repeaters as described in the above example 
allows some margin for overload. 


QUANTUM AMPLIFIERS 131 


5.3 Background Noise 


As a final comment, it is clear from eq. (12) that if the incoherent 
noise spectral height, N., at the regenerating repeater input is small 
enough so that (n/#2)N.<« G?/(G@)? then only the additional shot noise 
term is important amongst the three noise terms associated with the 
incoherent noise. 

This inequality always holds for the case where the incoherent 
noise is background (thermal) radiation in equilibrium at temperatures 
below 104 °K, since for thermal background radiation we have 


AQ 
N.(Thermal) = —_——, 


ef 2/ké — | 


ké = Boltzman’s constant - absolute temperature. 
At room temperature and at a wavelength of 1 um, “0/ké ~ 50. 
Therefore, in analyses where incoherent background radiation is 
included, one usually only includes the additional shot noise term 
LN ,(n/nQ) = Dd, in the signal-to-noise ratio formulae. 


VI. CONCLUSIONS 


We have shown that quantum amplifiers can have applications in 
both analog repeaters to extend the distance between regenerating 
repeaters and as front ends of regenerating repeaters. Their usefulness 
is a function of the ratio of the optical bandwidth of the system to the 
modulation bandwidth; but is not limited to small values of this ratio. 
To choose system parameters, for example, the required signal levels 
at the analog and regenerating repeater inputs, various component 
parameters such as the mean avalanche gain, avalanche detector type, 
source bandwidth, baseband thermal noise, etc., must be given. 
Computations in addition to those presented, upper bounding the 
error rates, can be carried out with previously published Chernov 
bound results;! or approximate error-rate calculations can be made 
using the signal-to-noise ratio results of Section III above. 


APPENDIX 


Use of the Karhunen-Loéve Expression 


Starting with eq. (2) of the text, we could expand the received 
complex envelope e,(t) = m(t)e**t + n(t) in a baud interval in terms 
of the Karhunen-Loéve eigenfunctions of the band limited incoherent 
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noise n(t), i.e., define 


R.(t,u), {We(u)}, and {ve} 
as follows 
Ri(tu) = (n(t)n*(u)) 


i Vi(w)Ra(tsuj)du = yibx(t) for te baud interval, 
baud interval 


k = 1, 2,3--- 
Then 
er(t) = » anwx(t) for te(0,7) 
1 
where 
at = Me + Nk 
mM, = i m (tet tp, (4)dt 
baud interval 
Ne = | n(t)pi(t)dt 
baud interval 
(nunj) = Ya5e3, (nknj) = 0 
and 


i . Welty; (Edt = 43;. 
baud interval 


Then we would find that Mo(s) of eq. (9) could be more rigorously 
given by 


exp {v/a = {mae = 9 /]1 - 2 rater - » | 


Thus in eq. (9) N. has been rigorously replaced by y, for each k 
and the finite number of terms LZ has been replaced by an infinite 
number of terms. 

If we make the reasonable assumption that the incoherent noise 
is lat with spectral height N, in a band of width B’ + 1/T then 


Vk ~ Ns for ] 2 k = L 
= Q otherwise (17) 
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where 


L=BT +1. 


Thus the form for M¢(s) derived in the main text is identical to the 
more rigorous result under this approximation. 
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A Proper Model for Testing the Planarity 


of Electrical Circuits 


By A. J. GOLDSTEIN and D. G. SCHWEIKERT 
(Manuscript received August 25, 1972) 


The question of whether an electrical circuit can be laid out on a plane, 
without resorting to crossovers or multilayer wiring, 1s usually answered 
by testing the planarity of a graph representing the circuit. 

Two commonly used representations are shown to be inadequate. We 
present the following new representation, and show it to be complete and 
unrestrictive: The graph has one node for each circuit module, and one node 
for each net; for every net with k modules, there is a ‘‘star’’ of k edges 
connecting the net’s node to each of the modules of the net. 


I. INTRODUCTION 


Electrical networks frequently consist of a set of modules (beam- 
leaded chips, DIPs, ete.), and a set of electrical interconnections or 
‘nets’? among two or more modules. Each net specifies a set of modules 
to be interconnected with a single conducting path. The planar design 
problem consists of placing the modules and the net wiring in the 
plane. The question of whether the interconnections can be accomp- 
lished in the plane without resorting to crossovers or multilayer 
wiring is usually answered by testing the planarity of a graph represent- 
ing the circuit. 

This graph is typically constructed by one of two mappings: 


(¢) Module-to-Node Mapping. The modules are represented by 
the nodes (or points) of the graph; and the nets are represented 
by its edges (or lines); or 

(it) Module-to-Edge Mapping. The modules are represented by 
the edges and the nets are represented by the nodes. 


Since the edge of a graph connects exactly two nodes, these mappings 
are not uniquely defined and a priori design decisions must be made 
which may be either improper or restrictive, and may produce spurious 
crossovers (see Sections II and III). 
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We give a unique representation that maps both nets and modules 
into the nodes of a graph, G. In this representation, a k-module net 
will appear as a “star” with an edge from its node to each of the 
modules in the net. We show that this mapping is a complete and 
unrestrictive representation of an electrical circuit. The main result 
of this paper (Section IV) is that the network can be laid out in the 
plane without crossovers if and only if G is planar. Thus the practical 
problem of planarity of these networks is solved since there are good 
computer algorithms for testing planarity.!~> Such algorithms will do 
a good but not optimal job of minimizing crossovers in a nonplanar 
graph. As with other mappings, we are ignoring certain practical 
restrictions, such as a specified cyclic terminal order for a module. 
Usually, these restrictions can be forced on the graph by auxiliary 
strategies. 

The representation presented here is similar to that given by Engl 
and Mylnski:° in order to properly represent a k-node net, the conven- 
tional definition of an undirected edge, i.e., a set of two nodes, was 
generalized to a set of k nodes. We demonstrate here that such general- 
ized concepts are unnecessary. By mapping both nets and modules 
into nodes, we retain the conventional definition of an edge, which 
greatly simplifies the presentation and proof, and most importantly, 
permits the use of conventional planarity testing algorithms. 


II. INADEQUACY OF THE MODULE-TO-NODE MAPPING 


Since nets map into edges, and an edge connects exactly two nodes, 
there is an inherent restriction to two-module nets. A common embel- 
lishment of the module-to-node mapping, is to decompose a k-module 





Fig. 1—Planar circuit (ignoring dashed net). 
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Fig. 2a.—Planar graph constructed using the module-to-node mapping on Fig. 1. 
Note A’ and A” are adjacent on 1. : 

Fig. 2b—Alternative planar graph. Terminals for A’ and A” are not adjacent 
on 1. 


net (k > 2) into a string of k — 1 two-module nets.* Thus k — 2 of 
the modules are formally permitted to have two terminals contacting 
the same net. Since the cyclic order of edges leaving a node is irrelevant 
in deciding whether a graph is planar, these two terminals may not be 
adjacent in a planar layout of the graph. If not adjacent, these two 
terminals may necessitate a crossover inside the module; we will 
term this a “module crossing.” 

For example, the electrical circuit in Fig. 1 has a three-module net A. 
If A is represented as two two-module nets A’ (8, 1) and A” (1, 6) then 
the module-to-node mapping yields a graph having a planar layout 
shown in Fig. 2a. Since the A’ and A” terminals on 1 are adjacent, they 
can be merged, and planarity is legitimately indicated. 

However, this graph has a second, and equally acceptable, planar 
layout (see Fig. 2b) in which the A’ and A” terminals on Module | are 
not adjacent, and a physical realization (see Fig. 3) of this second layout 
may require an unnecessary crossover inside Module 1, i.e., a module 
crossing. 

If one adds the additional net (8, 5) (shown as a dashed line in Fig. 1) 
then the graph has only one planar layout (Fig. 2b), and that layout 
requires a module crossing for its physical realization (Fig. 3). 

These two examples demonstrate that the module-to-node mapping, 
by arbitrarily inserting two terminals per module for certain nets, 
cannot distinguish layouts which are physically planar from those 

* The use of the complete graph for k nodes (all pair-wise connections), commonly 
but inaccurately used’ in graphical representations for partitioning and placement 


algorithms, is clearly unacceptable here since the complete graph for five or more 
nodes is nonplanar. 
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Fig. 3—Physical implementation of graph in Fig. 2b. Note “module crossing” 
atl. 


which use module crossings. Furthermore, if a planar layout of the 
graph requires the use of module crossings, there may or may not be 
an alternative planar layout of the graph which does not require the 
use of module crossings. 

When a k-module net is decomposed into two-module connections, 
it is possible to choose a decomposition which will produce a nonplanar 
graph even though the circuit is planar. For example, the circuit in 
Fig. 4 is planar, and the module-to-node mapping will produce a 
planar graph if net A is decomposed into the string of three two-module 
nets: (1, 2), (2, 3), (8, 4). However, one may have chosen the alterna- 
tive decomposition (1, 3), (2, 4), (8, 4) which yields the nonplanar 
graph shown in Fig. 5. 

Certain technologies permit a limited amount of ‘under-module”’ 
wiring, which may permit the required module crossing in the above 
examples. However, even if this capability exists, there are two 





Fig. 4—Planar circuit. 
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A" 


Fig. 5—Nonplanar graph resulting from an inappropriate decomposition of Net A 
in Fig. 4. 


objections to the use of this mapping: (7) the set of two-module nets 
which best represent the k-module net (k > 2) is difficult to determine 
a priort, and an arbitrary choice may result in unnecessary crossovers; 
and (iz) unnecessary module crossings may result. 


III. INADEQUACY OF THE MODULE-TO-EDGE MAPPING 


A module which connects to k > 2 nets cannot be simply represented 
as a single edge. A typical elaboration of this mapping® is to represent 
a k-net module as a ring of k two-net modules. For example, the 
four-net Module 2 in the circuit above (see Fig. l-ignoring the dashed 
connection) could map into the four edges shown in Fig. 6a. With 
similar representations for Modules 1, 5, and 6, the module-to-edge 
mapping for this circuit has the planar layout shown in Fig. 7. 

However, without the obviously planar schematic in Fig. 1 for 
guidance, one may have arbitrarily chosen the equally acceptable 


(a) Oar 


2a 2c 2a 2c 


OO O*O 
(a) (b) 
Fig. 6—Alternative decompositions of Module 2 in Fig. 1. 


140 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1973 





Fig. 7—Planar graph constructed using the module-to-edge mapping on Fig. 1. 


representation of Module 2 shown in Fig. 6b. In this case, the graph is 
not planar. 

Basically, the representation of a k-net module (kK > 3) as a ring 
of edges, requires the specification of the sequence of terminals leaving 
a module-a specification which may not be required by the physical 
problem. As demonstrated in the above example, an arbitrary choice 
of terminal sequence may be restrictive and may yield a false indication 
of nonplanarity. 

For certain designs, where the modules are predesigned and the 
terminal sequence 7s specified, the choice of ring sequence is obvious 
and not a restriction, but a practical requirement. Note, however, 
that the ring may appear as a mirror image in the planar layout of the 
graph; where the module cannot be physically mirrored, additional 
restrictions are necessary. 


IV. MODULE-AND-NET-TO-NODE MAPPING 


The previous two mappings fail to produce graphs which always 
reflect the planarity aspects of the circuit. In this section, we construct 
a graph, G, to represent the circuit and show that the circuit is planar 
if and only if G is planar. The graph G constructed from the net infor- 
mation has one node for each module plus a “‘net node” for every net. 
For every k-module net there is a ‘‘star’’ of k edges connecting the net 
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Fig. 8—Planar graph constructed using the new module-and-net-to-node mapping 
on Fig. 1. 


node to each module in the net. Using this mapping, Fig. 8 shows the 
planar graph representing the circuit of Fig. 1. 

The star-like subgraph is selected somewhat arbitrarily and can be 
replaced by any tree attached to the net’s modules. Recall (Section I) 
that the cyclic order of edges at a module is unrestricted. 


Theorem: The circuit is planar if and only if G is planar. 


Proof: If G is planar, then clearly the circuit is planar. Conversely, 
suppose the circuit is planar. Consider the planar subgraph of any net. 
(Since they are electrically unnecessary, we may assume the subgraph 
has no loops.) We will modify it to form a star. First, create a node s at 
any point of the subgraph which is not a terminal. Continue to modify 
the subgraph by repeating the following process at s until a star 
subgraph results: (cf. Fig. 9). 


Choose an edge (s, t) of the modified subgraph with t having at 
least two edges. Let (t, u) be the first edge at t in, say, clockwise 
order from (t, s). Create a new subgraph by replacing the edge 
(t, u) by an edge (dashed in Fig. 9) from s to u “running parallel’ 
and on the left side of the path s, t, u. If t now has only two edges, 
then delete t and coalesce its two edges into one. 


Since the subgraph of the net was planar, the resulting star subgraph 
is also planar and has a node s corresponding to the net. By replacing 
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Fig. 9—Net A of planar circuit in Fig. 4. 


every net subgraph by a star subgraph, we obtain a planar graph, G, 
of the desired type. Q. E. D. 

Two observations may substantially reduce the size of the graph 
which is tested for planarity. Since a two-module net results in a star 
with only two edges, it is clear that planarity is unchanged if this net 
node is deleted and the two edges are coalesced into one. Similarly, a 
two-net module results in a node with only two edges connected to it; 
again, that module node can be deleted and the two edges coalesced 
into one. 
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